
PROCEEDINGS 

OF THE 

ROYAL SOCIETY OF LONDON 


SERIES A. MATHEMATICAL AND PHYSICAL SCIENCES 


VOL 189 


LONDON 

Printed and published for the Royal Society 
By the Cambridge University Press 
Bentley House, N.W.i 


3 June 1947 



Printed m Great Britain at the University Press, Cambridge 
(Brooke Crutchley, University Printer) 
and published by the Cambridge University Press 
(Cambridge, and Bentley House, London) 

Agents for Canada and India: Macmillan 



CONTENTS 


SERIES A VOLUME 189 
No. A 1016. 27 March 1947 

PAGE 

Interaction of steam and charcoal at low pressures. By R. E. Strickland- 

Constable.1 

On the dimension of normal spaces. By P. Alexandroff.11 

The crystal structures of two polyamides ('nylons’). By C. W. Bunn and E. V. 

Garner. (Plates 1 and 2).39 

Limitations of the Newtonian time scale in relation to non-equilibrium rheo¬ 
logical states and a theory of quasi-properties. By G. W. S. Blair, B. C. 
Veinoglou and J. E. Caffyn.69 

Kinetic boundary friction. By J. R. Bristow. (Plate 3) .....' 88 

A general kinetic theory of liquids. II. Equilibrium properties. By H. S. Green 103 

Tables of second virial and low-pressure Joule-Thomson coefSicients for inter- 
molecular potentials with exponential repulsion. By R. A. Buckingham and 
J. Corner.118 

The oblique reflexion of very long wireless waves from the ionosphere. By M. V. 

Wilkes.130 


No. A 1017. 17Aprni947 

The fine structure of the spectral lines of Hg: measurements with a reflexion 

echelon. By E. W. Foster and Sir Owen Richardson, F.R.S. (Plate 4) . 149 

The fine structure of the lines of the Hg spectrum which go down to the 

states. By E. W. Foster and Sir Owen Richardson, F.R.S.176 

Radiative equilibrium in the ionosphere. By R. v. d. R. Woolley . . . 218 

Atmospheric tides in the ionosphere. I. Solar tides in the region. By D. F. 

Martyn.241 

Temperature changes in the crystal structure of barium titanium oxide. By 

H.D.Megaw. (Plate 5).261 

Visual adaptation in relation to brief conditioning stimuli. By B. H. Crawford. 

(Abstract) ..284 











iv Contents 


No. A1018. 1 May 1947 

PAGE 

Effect of crossed electric and magnetic fields on the helium spectrum. By J. S. 

Foster, F.R.S, and E. K. Pounder. (Plate 6).287 

The surprising amount of energy which can be collected from gases after the 

electric discharge has passed. By Lord Rayleigh, F.R.S.296 

Fragmentation of shell cases. By N. F. Mott, F.R.S.300 

On a method of increasing contrast in microscopy. By Sir Thomas Merton, 

Treas.R.S. (Plate 7).309 

The spectra of flames containing oxides of sulphur. By A. G. Gaydon and 

G. Whittingham. (Plates 8 and 9).313 

On the simple group of order 25920. By J. A. Todd.326 

The inversion spectrum of ammonia at centimetre wave-lengths. By B. Bleaney 

and R. P. Penrose.358 

Infinite irreducible representations of the Lorentz group. By Harish-Chandra . 372 

Thermal diffusion in mixtures of the inert gases. By K. E. Grew . . . 402 

A theory of the dependence of the rate of detonation of solid explosives on the 

diameter of the charge. By H. Jones.415 


No. A 1019. 3 June 1947 


Antoine Laurent Lavoisier, 26 August 1743—8 May 1794. By Sir Harold Hartley, 

F. R.S. (Plates 10 and 11).427 

Determination of the velocity coefficients for polymerization processes. I. The 
direct photopolymerization of vinyl acetate. By G. M. Burnett and H. W. 
Melville, F.R.S. ..456 

Determination of the velocity coefficients for polymerization processes. II. The 
photosensitized polymerization of vinyl acetate by benzoyl peroxide. By 

G. M. Burnett and H. W. Melville, F.R.S.481 

Determination of the velocity coefficients for polymerization processes. III. Effect 
of environment on polymerization processes. By G. M. Burnett and H. W. 
Melville, F.R.S.494 

The dehydration of ammonium, potassium and some mixed alums. By G. P. 

Acock, W. E. Garner, F.R.S., J. Milsted and H. J. Willavoys. (Plates 12 
to 14).508 

Slow neutron-induced activity in gold. By K. Siegbahn.527 

Mixed boundary conditions in the relaxational treatment of biharmonic problems 

(plain strain or stress). By L. Fox.535 


Vertical transport of heat by turbulence in the atmosphere. By C. H. B. Priestley 
and W. C. Swinbahk. 


Index , 


543 

563 













Interaction of steam and charcoal at low pressures 
By R. F. STEicBXAiirD-CoNSTABLB, D.Phil. 
{Communicated by 0. N. Hinshelwood, F.R.S.—Received 16 April 1946) 


The reaction between charcoal and water vapour at temperatures up to 700° C and at pres¬ 
sures of less than 1 mm. has been studied. 

Water vapour, hydrogen and carbon monoxide have all been shown to be reversibly 
adsorbed at the reaction temperatures. Hydrogen is adsorbed strongly and rapidly; carbon 
monoxide is adsorbed in considerably smaller quantity and much more slowly. 

When charcoal reacts with water vapour at low pressures, the pressure falls during the 
rapid initial adsorption of the water vapour, and subsequently rises during the desorption of 
the product gases. The proportions of carbon monoxide and hydrogen in the product gases 
were found to vary throughout the reaction in a special manner. The variations can, however, 
be simply explained on the basis of the adsorption data. 

The kinetics of the reaction at higher pressures are discussed: they can likewise be ade¬ 
quately explained on the basis of the adsorption data. 

Reasons are given for supposing that the water vapour is adsorbed m the molecular 
condition. 

The course of gas/solid reactions is generally explained as occurring in an adsorbed 
layer on the surface of the solid. The nature and extent of the adsorption of the 
reactant and product gases is often deduced from the course of the reaction without 
direct measurement. In the present research, by the use of low gas pressures and a 
solid with a large surface, an attempt has been made to measure the adsorption of 
the gases separately, and to relate these to the course of the reaction. 


Experimental 

The apparatus is illustrated in figure 1. It consists of; J., the reaction bulb 
(vol. 185ml.); B, the capacity bulb (vol, 940mL), which was used to measure out 
charges of water vapour; (7, connexion to the Macleod gauge; D, tap to reaction 
bulb; B, connexion leading to the analysis bulb; F, tap leading to gas storage 
apparatus; O, a thermocouple; H, furnace; K, tap leading to vacuum pumps. 

The reaction bulb, of porcelain, was heated in an electric furnace whose tempera¬ 
ture was measured by means of the thermo-couple. In the storage a|)paratus water 
and the various gases required were stored in a series of bulbs. The apparatus, except 
for the reaction bulb, was of soda glass. The usual precautions with regard to 
cleanliness that are necessary in low-pressure technique were observed. The tap 
K led to a Gaede mercury-vapour pump, by which the apparatus could be rapidly 
evacuated. 

To measure the pressure of water vapour the compression stem of the Macleod 
gauge was heated in a hot-water bath at 100° C. The latter consisted of an electrically 
heated glass jacket containing water to which a small amount of soap had been 
added to facilitate steady boiling. Thus as the water vapour which was trapped at 
room temperature in the body of the Macleod was compressed into the compression 


Vol. 189. A. (27 March 1947) 


[ 1 ] 



2 


R. F. Strickland-Constable 


stem, it was raised to 100 ° C, and was thus maintained above its dew-point. When 
used in this way with permanent gases the readings were found to be increased in 
the calculated ratio as compared with those read on an unheated instrument. For 
use with water vapour the instrument was calibrated against known pressures 
which were established by surrotmding a portion of the main apparatus containing 
water by a bath held at controlled low temperatures. In this way it was shown that 
over the range of 150 to 1400 x 10 “® mm. the pressure read was fairly accurately equal 
to 90 % of the value obtained with permanent gases. This correction was accordingly 
made to all readings of pure water-vapour pressures. 


E 



In previous work with similar apparatus (Strickland-Constable 1944 ) and naing 
gas pressures ranging from 0-02 to 1-0 mm., it had been found that aU the gases used, 
including N 2 O and CO 2 , obeyed Boyle’s law; thus if one vessel at a known pressure 
was put into communication with a second evacuated vessel, the proportional change 
in pressure was independent of the initial pressure in the first vessel, and of the 



3 


Interaction of steam and charcoal at low 'pressures 

nature of the gas. But this was far from the case with water vapour, of which a 
large proportion appeared to reside on the surface of the glass, and not in the gas 
phase. 

Many experiments were performed to investigate the behaviour of water vapour, 
and as the result of these and of the work of Frank ( 1929 ), the following conclusions 
were found to apply to an apparatus of the dimensions used in the present work: 

When the measured pressure is very low the greater part of the water vapour 
present resides on the surface of the vessel, owina: to the strong adsorption of the 
first one or two layers of molecules. On increasing the pressure the proportion 
present in the gas phase increases, and over about 300 x 10 “^ mm. at least half the 
total water is so present. As, however, the pressure approaches saturation the 
situation is again reversed, and an ever greater proportion resides on the glass 
surface. Two sets of experiments can be briefly described to illustrate these points. 
Hydrogen and oxygen were introduced into the analysis bulb, their pressures being 
measured on the Macleod. On heating the platinum filament, water vapour equal in 
pressure to the figures given in column 1 of table 1 should have been produced: the 
pressures actually observed are shown in column 2 . Column 3 gives the observed as 
a percentage of the expected values. 

Table 1. Peessures nsr mm. x 

found/expected 


expected foxmd % 

560 350 62i 

408 220 54 

200 75 37 

30 0 0 


In another set of experiments a rather high pressure of water vapour was estab¬ 
lished in a small vessel with a high surface-volume ratio, and the water vapour was 
then expanded into the analysis bulb. When 10 mm. pressure was established in the 
small vessel, the pressure developed on expansion into the analysis bulb was 
325 X 10“®mm., although with a permanent gas the pressure would have been only 
108 X 10“^ mm. The temperature corresponded to a saturation pressure of about 
13 mm. A layer of water at least 20 mol. thick on the walls of the small vessel was 
calculated to be necessary to account for the above result. 

In what follows the quantities of water vapour used are recorded by pressure 
measurements, though it must be remembered throughout that the actual quantities 
of water present are usually more than double those corresponding to the recorded 
pressures, and that the total amounts present are not even directly proportional to 
these pressures. 

The analysis bulb and the method of using it for analyzing mixtures of CO, CO 2 , 
and O 2 have been described in a previous paper by the author ( 1944 ). In the present 
work the mixture could consist of HgO, Hg, •CO 2 and CO. Although it was useless 
to attempt to determine accurately the amount of HgO present, owing to the strong 



4 


R, F, Strickland-Constable 


adsorption on the walls of the apparatus, it was sometimes possible, by using the 
P 2 O 5 reagent tube, to obtain a minimum value. Usually, however, HgO was first 
absorbed in the P 2 O 5 , and the analysis then carried out on the dry mixture remaining 
as follows: 

CO 2 was first determined by absorption in the Sofnolite. The Sofnolite cut-off 
was then closed (the P 2 O 5 cut-oflf being left open throughout the analysis), oxygen 
was added, and the Hg and CO were burned on the platinum filament. The fall in 
pressure was noted, the Sofnolite tube was then opened to absorb the CO 2 formed, 
and a second fall in pressure was noted. These two pressure drops allow of the 
calculation of the amounts of CO and Hg present. The method was found to be 
accurate at pressures of over 200 x 10 ~®mm. At lower pressures H 2 tended to be 
too low and CO too high. A sample test analysis is as follows: 

total pressure 400 x 10~® mm. 

took CO 48% H 2 52% 

found CO 46% 61 % 

It was established that at the temperature at which the platinum filament was 
heated for the combustion of the Hg and CO, no burning of CH^ took place, so that 
the latter gas would have remained as an unburnt residue in the analysis. No 
methane was in fact ever found. 

The charcoal consisted of 5g. of a steam-activated coconut charcoal known as 
Dorsite, as used in previous experiments (Strickland-Constable 1938 ). 

Results 

Before giving the detailed experimental results, an outline of the general theory of 
the reaction which was deduced from them wiQ be given. Most of the experiments 
were performed at 700° C, and this temperature will be assumed unless the contrary 
is specified. It appears that even at this temperature HgO is reversibly adsorbed on 
the surface of charcoal, and this adsorption takes place very rapidly. In the adsorbed 
condition the water reacts with the charcoal to form CO and Hg, and these products 
are then desorbed. Both CO and Hg are themselves reversibly adsorbed at this 
temperature. 

When therefore a low pressure of water vapour is allowed to react with a charcoal 
previously outgassed at a much higher temperature, the water is adsorbed and the 
pressure falls at once to a very low figure where it remains indefinitely. The water in 
the absorbed condition reacts to form CO and Hg, but these are likewise retained by 
strong adsorption on the surface. Only by repeating this process a number of times 
or by using a comparatively high pressure of water vapour does the charcoal 
become more or less saturated with product gases, and the sequence of events, on 
introducing a further charge of water vapour, is as follows: 

The pressure falls at first to a rather low value and then begins to rise again. The 
initial fall is due to adsorption of water, and the subsequent rise to desorption of 
product. If a sample of the gas phase is taken early in the reaction, when only a 



5 


Interaction of steam and charcoal at low pressures 

small amount of product has been evolved, it is found to contain a much higher 
proportion of H 2 than a sample taken later on, which consists largely of CO. This was 
one of the principal features of the reaction. The explanation appears to depend on 
the fact that at equilibrium Hg is adsorbed much more strongly than CO; at the 
same time the adsorption and desorption of Hg are far more rapid than is the case 
with CO, so that Hg adsorption equilibrium is established very quickly, that of CO 
much more slowly. In the early stages of the reaction the Hg will desorb more rapidly 
and hence be in excess in the gas phase. Later in the reaction, as adsorption equili¬ 
brium is approached, CO will be in excess in the gas phase, owing to its much smaller 
equilibrium adsorption. 

In the first set of experiments to be described the charcoal was outgassed at 
1000° C, and thereafter a series of runs was carried out. In these runs a charge of 
water vapour was admitted to the capacity bulb and its pressure measured on the 
Macleod. Tap D was then opened so that the capacity bulb and reaction bulb were 
placed in communication, and the change of pressure with time was observed. The 
pressure at ^ = 0 was estimated from the original pressure in the capacity bulb, this 
estimate being based on sharing experiments previously.carried out at the appro¬ 
priate temperature when the reaction bulb contained no charcoal. The results are 
given in table 2. 


Table 2. Capacity bulb method, 700° C 




pressures (mm. x 10“®) at t (min.) 


HgO adsorbed 
ml. of vapour 
at N.T.p. 

run 

0 

1 

2 

6 

. 15 

30 

118 

990 

360 

290 

130 

70 

70 

1*440 

119 

930 

450 

390 

260 

170 

160 

1*270 • 

120 

1170 

720 

640 

590 

510 

470 

1*400 


In the first run the initial pressure was 990 x 10“^ mm. The pressure fell rapidly 
however, and after 30 min. only about 70 x 10-® mm. were left. The residual gas 
consisted of and CO with httle or no water vapour. The 5 g. of charcoal had 
therefore adsorbed 1-44 ml. of water vapour (nominally calculated to n.t.p.) with 
the formation of little or no gaseous product. The object of these runs was to saturate 
the charcoal, since unless this was done no substantial amount of product gases could 
be obtained. Two more runs were accordingly carried out by the same method, each 
run being preceded by a short evacuation to remove gases left in the gas phase. At 
the end of the third run the gas pressure was about six times that after the first, 
showing progressive saturation of the charcoal. Finally, before proceeding to the 
next series, a pressure of 16 mm. of water vapour was established in the bulb and 
maintained there for 20 min. 

In the next series of runs the capacity bulb was not used, the HgO being intro¬ 
duced into the Macleod alone, where its pressure was measured, before opening the 
tap to the reaction vessel. The volume of Macleod -i- reaction bulb (the latter reduced 
to room temperature) was only 200 ml. as compared with a total volume of 1140 ml. 



6 


R. F. Strickland-Constable 


when using the capacity-bulb method. After opening tap D the course of the reaction 
was followed as before, and the results of some runs are given in table 3. 


Table 3. Normal method, 700° C 
pressure (mm. x 10”®) at t (min.) 


run 

0 

1 

2 

10 

20 

30 

126 

750 

260 

280 

390 

— 

— 

127 

660 

230 

240 

330 

400 

— 

129 

820 

300 

310 

480 

600 

690 


The apparatus was evacuated between runs for sufficiently long to reduce the 
rate of gas evolution to a conveniently low value: 2 or 3 min. was usually sufficient. 
It will be seen from the table that the pressure falls during the jSrst minute to 
approximately one-third the initial value, and then rises slowly. The initial fall 
represents the very rapid adsorption of the water vapour, and the slow subsequent 
rise the evolution of the products of reaction. The reaction HgO H- 0 = CO -f- Hj 
demands a doubling of the initial pressure. As wiU be seen from the table, however, 
the pressures never even regain their initial values. This appeared to be due to the 
fact that although the rate of desorption was high at very low pressures, it fell off 
rapidly as the pressure rose. Thus a considerable part of the gas produced in each 
run was only desorbed during the evacuation of the bulb between runs. 

In other runs by the same method, samples of gas were taken for analysis (see 
table 4). 

Table 4. Normal method, 700° C 



pressure 
at ^=0 

time of 
sampling 
(min.) 


analysis % 


run 

COg 


CO 

unknown 

128 

660 

1 

12 

42 

44 

2 

130 

700 

i 

13 

m 

36 

H 

127 

540 

21 

H 

m 

84| 

-i 

129 

693 

34 

4 

18 

78J 

-h 


. The time of sampling appeared to have an important effect on the composition of 
the sample. Early samples had a higher proportion of compared to later samples, 
as will be clearly seen in the above table. The explanation of this observation has 
already been given. (It will, of course, be noticed that the COj content of the early 
samples is higher than that of the later samples. The effect is, however, insufficiently 
great to account for the much larger changes in hydrogen concentration.) 

In yet another type of experiment the water container was allowed to com¬ 
municate with the reaction bulb until a high pressure of vapour had been established. 
In run 199 for instance a pressime of 18 mm. of water vapour was established in the 
bulb. A sample was taken after 18 min ., and, after removal of water vapour, the dry 
gases analyzed to: COg 23 %, 24 %, CO 51 %. The bulb was then evacuated for 

1 min., and a sample of gas allowed to accumulate for ^ min.; the dry gases in this 



7 


Interaction of steam and charcoal at low 'pressures 

case analyzed to: OO 2 2 %, Hg 93 %, CO %. The explanation of the change in 
hydrogen content is the same as before: the sample taken 18 min. after the intro¬ 
duction of the water vapour corresponds to the later samples of table 3 , and had a 
low hydrogen content owing to approach to adsorption equilibrium. The sample 
taken immediately after a short evacuation consisted of nearly pure hydrogen owing 
to the high rate of desorption of the latter, as compared with 00 . 

That this view was the correct one was confirmed by the following further experi¬ 
ments. Equal volumes of CO and were admitted to the reaction bulb at 700° C 
and left for over 2 hr., at the end of which time the pressure was 11 ^ mm. A sample 
of the gas phase now analyzed to: COg 0 %,. H 2 8 %, CO 81 %, Ng 11 %. The bulb 
was then evacuated for 1 | min., and a further sample was allowed to collect during 
2J min., which gave on analysis: COg 1 %, 00 8 %, Hg 87 %. The 11 % of Ng recorded 
in the first analysis was attributed to the presence of a small amount of this gas as 
an impurity in the CO used. Owing to its not being adsorbed the small proportion 
present in the initial gas became concentrated m the gas phase as a substantial 
constituent. 

The difference in the rates of adsorption of CO and was shown directly in an 
experiment in which a charge of CO was admitted to the bulb at 700° C after the 
charcoal had been outgassed at 1000 ° C. The variation of pressure with time is shown 
in table 5. 

Table 5. CO, 700° C 

t (min.) 0 i 2 5 15 30 

pressure (mm. x 10““®) 3620 3500 3260 3020 2660 2040 

After evacuation for several minutes, to remove all the gaseous CO, a charge of 
Hg was admitted (see table 6). 

Table 6 . Hg, 700° C 

t (min.) 0 J 2 6 15 

pressure (mm, x 10~®) 4600 1720 342 246 213 

Thus H 2 appears to be adsorbed far more quickly than CO: in the first 2 min. 
360 X 10 ”^-mm. of CO were adsorbed, as against 4258 x 10 ”® mm. of Hg, and this in 
spite of the fact that the adsorption was taking place on a charcoal already partly 
covered with CO. 

The difference in the total quantities which could be adsorbed was shown in a 
qualitative way in the following experiments: 00 was admitted to the reaction bulb, 
which was maintained at 700° C for half an hour, during which time a fall in pressure 
of 2400 X 10 "®mm. was recorded. The bulb was then evacuated for 7 sec.; the pres¬ 
sure was now found to be 270 x 10 ”® mm., and it continued to rise slowly. Thus after 
adsorbing 2400 x 10 "® mm. of CO the equilibrium pressure is over 270 x 10 ”®. (It 
may be noted that 1 mm. pressure in the Macleod-j-reaction bulb at 700° C is 
equivalent to about 0*25 ml. of gas^ at n.t.b.) 



8 


R. F. Strickland-Constable 


In a similar experiment with Hg, a pressure of 8600 x lO^^mm. was adsorbed, 
after which the equilibrium pressure was less than 200 x 10“^ mm. This experiment 
shows that the adsorption of H 2 is at least three times that of CO, and probably 
considerably more. 

Experiments were made to show that the adsorption of water vapour is reversible. 
At 400® C a pressure of 18 mm. of water vapour was established in the bulb and 
maintained for 15 min. The bulb was then evacuated for 5 min., a time sufficient to 
remove very completely any gas originally present in the gas phase. During the 
following 30 min. a pressure of about 0*5 mm. of gas accumulated in the bulb, and 
this was shown to consist of over 95 % water vapour. 

A very similar run at 600® C gave a gas sample containing over 40 % of water 
vapour. At this temperature the rate of reaction, although much slower than at 
700® 0, was nevertheless beginning to be appreciable. At 700® C, owing to the much 
greater rate of reaction, it was more difficult to obtain evidence, but even at this 
temperature a minimum of 9 % of water vapour was found in a sample of gas which 
was allowed to collect after 2 min. evacuation following a saturation of the charcoal 
as before. 

The presence of certain amounts of COg has been disclosed in the various analyses 
quoted. In table 4 the two early samples had a ratio of COg to CO of 1 to 3*7, whereas 
the two late samples had a ratio of less than 1 in 20. In run 199 the proportion of 
COg to CO was 1 to 2-2. No special study was made of the conditions of formation 
of COg, nor was the adsorption measured. It is most probably formed as the result 
of the secondary reaction CO + HgO = COg + Hg. The amounts formed were in no 
case sufficient to obscure the observations that were being made on the main 
reaction. 


Discussion of results 

A general explanation of the low-pressure reaction in terms of the measured 
adsorption data has already been given at the beginning of the paragraph headed 
" Results and need not be repeated here. 

In what follows, the word adsorption will be used for a strictly reversible process, 
which will be clearly distinguished from an irreversible sorption. It has been shown 
that both the products, Hg and CO, and the reactants, water vapour, are reversibly 
adsorbed up to at least 700® C. The fact that the sorption of water vapour is reversible 
is of interest from the point of view of the mechanism of sorption at these relatively 
high temperatures. Sorption at these temperatures is often considered to involve 
dissociation of the molecule into atoms: if, however, the sorption of HgO is reversible 
such a dissociation would in this case seem to be most unlikely in view of the known 
very high rates of reaction of such gases as Og and NgO with charcoal at these 
temperatures. It would appear, in fact, that forces must exist which hold the 
complete molecules HgO, CO and Hg on the surface in such a manner that they can 
easily be desorbed again into the gas phase. 



9 


Interaction of steam and charcoal at low pressures 


In considering the bearing of the present results on the steam-carbon reaction at 
higher pressures reference is made to the comprehensive work of Gadsby, Sykes & 
Hinshelwood ( 1946 ). In this work, which was carried out at steam pressures of 
from 100 to 760 mm., it is shown that the rate of reaction is given by an expression 
of the form of 


rate = 




( 1 ) 


where = partial pressures of HgO, and p^ = partial pressures of Hg. This formula 
corresponds to a reaction that at high steam pressures approaches zero order with 
regard to steam, and is retarded by hydrogen. 

Now such a reaction formula can easily be accounted for on the basis of the 
present results. One of the principal points established here is the slowness of the 
desorption of the CO, compared with other stages of the reaction. Thus at 700° 0 
the desorption of CO was found to take many minutes, while the adsorption of 
H 2 O was largely completed in a fraction of a minute. Qualitatively, therefore, the 
reaction may be expected to be controlled by the, slow desorption of CO and to 
approach zero order at higher pressures. The strong adsorption of Hg, which was 
demonstrated, would account for the retardation by Hg, although to complete the 
evidence for this it would be desirable to show that the adsorption of Hg reduces 
the amount of H 2 O which can be adsorbed. 

Quantitatively it is possible to deduce a formula for reaction rate on the basis of 
such a mechanism, assuming, that is to say, reversible adsorption of HgO and Hg, 
reaction of adsorbed HgO to adsorbed CO, followed by slow desorption of CO 
(owing to its weak adsorption the reverse reaction consisting of the adsorption of 
gaseous CO can be neglected). Let 

H2O Hg CO 

fractions of surface covered be ^2 ^3 

partial pressures of gases be p^ P 2 — 

Then rate of desorption of CO = Tc^s^, and in the steady state this must equal the 
rate of formation of CO from adsorbed HgO, which equals k^, etc., being 

appropriate constants). 

Therefore k^s^, — k^s-^. (2) 

For HgO, rate of adsorption = rate of desorption + rate of reaction to CO: 


For Hg adsorption equilibrium 

Equations (2), (3) and (4) can be solved to give 

k-Jc^ 


rate of reaction = = 




Pi 




^2 + ^5 


(3) 

( 4 ) 


(5) 



10 


R. F. Strickland-Constable 


This equation is of the same form as (1). It is also nearly the same as a formula 
deduced by Gadsby et al. ( 1946 ), which does not, however, distinguish the separate 
steps of the reaction of adsorbed H^O and the desorption of adsorbed CO. This 
formula gives the rate of reaction as 


h±]^ 


_ A/o 
l+^i52 + 


h 


JCn 4 * k\ 


Pi 


( 6 ) 


Equations (5) and (6) differ only in the last term of the denominator where an 
additional factor (1+ ^ 5 /^: 6 ) appears in (5). This whole term is the one which is 
responsible for the reaction tending to zero order at high steam pressures: and if 
this effect is due to a relatively slow desorption of CO, then will be large 
compared with unity, and will determine the importance of the term. If, on the 
other hand, the rate of desorption of CO were fast, then k^k^ would be small and 
the size of the term would be determined by the ratio of k^ to k^ + k^; in this case the 
zero order at high pressures would be conditioned by a slow rate of desorption of 
HgO {k 2 small), together with a slow rate of reaction of adsorbed HgO (k^ small), or 
in other words by the surface being largely covered by adsorbed HgO. 

It is considered that the present results favour the view that the slow desorption 
of CO is the controlling factor, and that the mechanism will therefore be represented 
by equation (5) with a value of kjk^ that is large compared with unity. 


The writer wishes to express his special thanks to Professor C. N. Hinshelwood for 
his advice and help throughout the work described in this paper. The work was 
carried out as part of the programme of the Fuel Research Board of the Department 
of Scientific and Industrial Research, and is published by permission of the 
department. 

Refeeences 

Frank 1929 J. Phys, Chem. 33, 970. 

Gadsby, Sykes & Hinshelwood 1946 Proc, Boy, Soc, A, 187, 129. 

StrioMand-Constable 1938 Trans, Faraday Soc, 34, 1375. 

Strickland-Constable 1944 Trans, Faraday Soc, 40, 333. 



On the dimension of normal spaces 

By P. Alexanbeofp 

Mathematical Institute of the Academy of Sciences, Moscow 

{Communicated by M. H. A. Newman, F,B.S.—Received 24 April 1946) 

The main purpose of this paper is to generalize for the case of normal spaces the 
fundamental theorem of the homological dimension theory and the existence 
theorem on Cantor manifolds in n-dimensional spaces. 

This generalization is based on 

( 1 ) The generalization! of the same theorems, and many others, for bicompacta 
(i.e. bicompact Hausdorff spaces)—§§ 4, 5. 

( 2 ) The study of certain "maximal’ subspectra of the spectrum of a bicom- 
pactum—§ 3. 

(3) A new approach to the maximal, or Cech, extension of a normal space—§ 6. 

(4) A suitable definition of the generalized notion of Cantor manifold, which is 
combinatorial in character and, when applied to compacta, corresponds to the 
classical definition given by Urysohn---§ 7. 

In §§ 1,2 I recall well-known definitions regarding homological invariants of 
bicompacta, and their coverings and spectra (see, for example, my papers ( 1941 a, 
1943 ) and the fundamental work of Lefschetz ( 1942 , chapter vi)). 

Notations. U denotes set-theoretical sums, H intersections of sets; A stands for 
the void set. 

1. SiMPLICIAL SPEOTBA 

A simplicial spectrum (as defined in Alexandroff 1943 , §28) is a directed set 
(Lefschetz 1942 , p, 4) C = {C^,w^} of finite simplicial complexes (or "nerves’—see 
Alexandroff 1943 , §§28,30) C^, together with certain simplicial mappings ("pro¬ 
jections’) under the following conditions: 

( 1 ) If Cj$ follows (7^ in the directed set C (fi>ocis written for this), then there is 
given a finite number > 1 of projections of in (7^. 

( 2 ) If v/£ and are two projections (of in CJ, then for each simplex € 0^ 
the simphces t'^ = and tl = w"/t^ are faces of a certain simplex t^ e G^ (^ ^ t^, 
tl < t^ is Avritten for this, thus considering each simplex as its own 'improper’ face). 

(3) If 7 > /? > a and urj, are projections, then the mapping of Gy in G^ 
is a projection. 

t These generalizations could be obtained entirely on the basis of the methods developed 
in Alexandroff ( 1932 , 1940 , 1941 a) and Alexandroff, Hopf & Pontrjagin ( 1935 ); butashorter 
and much more elegant way is given by combination of these methods with those used by 
Hurewicz & Wallman ( 1941 ) in their excellent book. It is, however, necessary to recall, 
since no reference is made by Hurewicz & Wallman, that the main theorems of the homo- 
logic'al theory of dimension were established in the paper by Alexandroff ( 1932 ). 

[ 11 ] 



12 


P. Alexandroff 


The oriented simplices of 0^^ will still be denoted by t^, the group of all r-dimen- 
sional chains of (with respect to a fixed coefficient-group) and the Betti groups 
(the homology—or A—and the ‘cohomology’ or V—groups) will be denoted by 
La and A^, respectively. 

Then any projection generates the homomorphism 


m 


/S 

a 


of into 


determined by 

0 

with = 1 , — 1 , .0 according to 
= 4i. = -tai, + ±tcci 


7T 


a 


of LI into 




(see e.g. Alexandroff 1941 a, §2). The operators tet^ and are conjugate homo- 
morphisms and commute with the boundary operators A and V respectively; they 
produce therefore homomorphisms ur^, of the Betti groups, namely, 

the homomorphism of A^ into AJ^ | the homomorphism of into V^. 


The condition ( 2 ) in the definition of a simplicial spectrum ensures that these homo¬ 
morphisms do not depend on the special choice of the projection of 0^ in C(,. 
Thus we have the 


inverse group spectrum {A^, | the direct group spectrum {V^, tt^} 

(see Alexandroff 1941 a, §§3,4; Alexandroff 1943 , §§22,29; or Lefschetz 1942 , 
chapter vi). The corresponding limit groups are the r-dimensional Betti groups 
A^, VJj of the simplicial spectrum C with respect to the given coefficient-group 
(discrete for V and bicompact for A). 

The simplest direct definition of the group is the following: 

Take the set of all r-dimensional V-cycles of all € 0\ the V-cycles z^ and z^ 
belong by definition to the same (r-dimensional) V-class z^, if there exists 0 
following in (7 on both Ca and C^, such that 

7 r“z;~;r^z^ in Cy. 

The r-dimensional V-classes are by definition the elements of V^. To get the sum of 
two V-classes z"- and z"^ take arbitrarily z^ez^ z^ez'^ and a Cy following in C' on 
both Ca and CJj; the V-class z*" containing 

n-y^^+TrfyZ'f 

is by definition z'^'+z"’’. 

Now suppose that in each (7^ a closed subcomplex is given. Denote the Betti 
groups of Aa by A„q, VJq. Suppose that for any pair /? > a and any choice of the 
projection wi it is given that wiA^^Aa- Suppose, furthermore, that for t^eA^ 



13 


On the dimension of normal spaces 

the condition ( 2 ) in the definition of a simplicial spectrum always holds with e 
Then the with the mappings considered on them form a spectrum, which is 
called a subspectrum A = {A^, of the spectrum C = {0„, Denote the open 
subcomplex by this subcomplex (more precisely, the system of its 

oriented simphces) is a cell-complex in the sense of Kolmogoroff and Tucker (see 
e.g. Alexandroff 1943 , §§5-7), its Betti groups will be denoted by AJi, Now 
consider any chain on with coefficients belonging to a bicompact group. 
The set of all r-chaios on which coincide with x^i on is denoted by 
and called the A-extension of over (this slight deviation from the definitions 
of Alexandroff ( 1943 ) corresponds to the real state of affairs and is thus reasonable). 
For a chain on II^ with respect to a discrete coefficient-group the trivial 
V-extension over is defined as the chain on 0^ equal to x^j^ on 

and equal to zero on A^ = 

The intersection-operator of Alexandroff ( 1943 , § 11 ) is still denoted by for any 

chain a^ on and any system of chains which are equal to each other on the 
chain J%xX^ is by definition that chain on which is equal there to x^- 
We define for any e e 

This gives a homomorphism of in A^i and therefore an inverse group spec¬ 
trum {AaijUr^i} with the bicompact limit group denoted by A^ and called the 
r-dimensional A-group of C modulo A. 

In a dual way consider for <i€Sai€V;i the V-cycle which is zero on 

Ap so that is a V-cycle on DeiSne ^ 

This gives a homomorphism of V^^l i^^^ (conjugate to the homomorphism 
—see Alexandroff 1943 , §§ 20-22 and 29). In this way the direct spectrum 
TT^J} is obtained with the discrete limit group 

V?^=lim(V;i,77^i) 

called the r-dimensional V-group of C modulo A, To repeat once more: the elements 
of V^ are V-classes, two V-cycles 2:^1 and belonging by definition to the same 
V-class if there exists a, Cy,y>^,y>a such that 

7r^lzli~ n^z}x on H^x. 

Remark: 1 . A simplicial spectrum is called exact if for each pair /3> a there is but 
one projection and this projection maps on to 0 ^. 

Remark 2 . Let Q, Q' be two directed sets; suppose that each element of is 
an element of Q and that y? > a in ^Q' implies y? > a in i2. Then i3' is called a directed 
subset of Q. A directed subset of Q is called a confinal subset of the directed set Q 
if for any element a or there is an element yS' of Q' which follows on a in Q, 



14 


P. Alexandroff 


2. Canootcal sets and spectra of a space 

Throughout this paper ‘space’, ‘luapping of a space’, ‘covering’ mean ywrrml 
space, cordinuous mapping, finite coverii^, respectively. 

Let i? be a space, M an arbitrary point set in B. We denote by BiM], and some¬ 
times by BM, the closure of Jkf in JS, by Js.{M) (sometimes Js.M, J{M) or even JM) 
the set of all interior points of M with respect to J?: 


Js,{M) = i? \ B{B \ M). 

A closed set in jR is said to be canonical if it is the closure of some open set; an 
open set is said to be canmical if it is the set of aU interior points of a closed set. 
Canonical closed and canonical open sets will be denoted by A and U respectively. 
If A = B{0), where 0 is open in B, and U is the set of all interior points of A, then 
a fortiori 

■' A = B{U). 


If U = J(B), where E is closed in B, and A is the closure of U, then 

U = UA). 


Thus 


whenever 

U = B{U)} 


lA = B{U), 
[U = J{A). 


rinally, it is easy to see that canonical open (closed) sets may be defined as com¬ 
plements to the canonical closed (open) sets. 

A closed (open) covering of the space B is called ocmonical if the elements of the 
covering are canonical. An excLct cov^fiTig is by definition a canonical closed covering, 
no two elements of which have interior points in common. 


Leivima 2 *1. Any closed covering 

of R has an exact refinement with a nerve which is a sub complex of the nerve of e. 
Proof. Let = 

Obviously Oi^J{E^), Define A-^ = -B(Oi), 

and suppose that in the closed covering 

* j • ' * 5 -^s} 

the elements are canonical, A^^E^ and that no A^ has common interior points 
with any other element of Sj., Define 

\ (^1 U ... U Aft U i?ft^2 U . . . U Pg) C J(Pft+i), 

= -S{^A:-{-l)j 
— {Ai5 •••> Aft+1, J5ft^25 



15 


On the dimension of normal spaces 

The covering is exact and a refinement of e. Since any open covering o) possesses a 
•closed refinement with a nerve which is a subcomplex of the nerve of (W there follows: 

Corollary 1. Any open covering o ) oi R has an exact refinement with a nerve 
which is a subcomplex of the nerve of o). 

Furthermore 

Lemma 2 - 2 . Any open covering o) has a canonical open refinement with a nerve 
which is a subcomplex of the nerve of o). 

Proof. Let a> = OJ. Take the closed e^^o^ so as to get a covering of R; 

then choose the open o^ under the condition 

and put U^= JR{o^. The covering 
has the requured properties. 

If the covering is a refinement of the covering a but a is not a refinement of /?, 
then ^ is said to follow a in the partially ordered set of all coverings of the space R 
(which is supposed to contain an infinite number of points). In this way the sets of 
all open, open canonical, closed, closed canonical, and exact coverings of R become 
directed sets. 

We now associate with each covering a its nerve (7^, and if the covering ^ follows a, 
we put in correspondence with each element of /? an element of a containing 

as a subset. This realizes a simplicial mapping of in C^, and the mappings 
so obtained are called projections. 

In this way we obtain: 

( 1 ) The spectrum of all open 

( 2 ) The spectrum of all open canonical 

(3) The spectrum of aU closed coverings of R. 

(4) The spectrum of all closed canonical 

(5) The spectrum of all exact 

The spectrum of the exact coverings of jB is exact and is called the spectrum of R. 

Among the five spectra just mentioned ( 2 ) is a confinal part of ( 1 ); and (4), (5) are 
confinal parts of (3). It follows that ( 1 ), ( 2 ) on the one hand and (3), (4), (5) on the 
other have the same A- and V-groups. Furthermore, it follows from the normality 
of R that ( 1 ) and ( 3 ) have the same A~ and V-groups (see e.g. Lefschetz 1942 , chapter 
vii). Therefore: 

All of the spectra (1 )~(5) have the same A- and the same V-groups; these are called 
the A- and V-groups of the space R. 

Let jE 7 be a closed set in R. For any covering e of jB the covering Ee of E is defined 
as the set whose elements are the non-empty intersections of E with the elements 
of e. The nerve of is a subcomplex of the nerve of e, and this subcomplex is 
called EC^. The subcomplexes EG^ form a subspectrum EC of the spectrum (7. This is 



16 


P, Alexandroff 


of course true if we take for the spectrum 0 any of the spectra ( 1 )—(5). In particular, 
the spectrum EC is exact if C is the spectrum (5). In all of these eases the A- and 
V-groups of C mod EC are the same; they are called the A- and V-groups of It mod E. 

Let / be a mapping of the space R into the space B*. This mapping determines a 
homomorphism J?* (or H.*) of VJj* in and a homomoirphism (or of A/j in 
these homomorphisms being conjugate; 

{H^x.x*) = {x.H*x*) for a;*€V5j. (xeA^) (1) 

(the scalar product is taken in the sense of the Pontrjagin character theory; see 
e.g. Alexandroff 1943 , §1). 

The homomorphisms jBT*, can be defined on the basis of any one of the five 
spectra (1 )~( 5 ) and this without influence on the final result. Let us use, for instance, 
the spectrum of the open coverings. Take any V-class e and any e 2 ;*. Let 


a* = {0* 

(among theinvefrse images 0^ = f~^Of of course only those are taken which are not 
empty). The correspondence Of realizes a simplicial mapping h of the nerve 
of a into the nerve O^* of a*, and this mapping transforms the V-cycle z^^ into the 
V-cycle z„ defined by 


for any oriented simplex of O^. The V-class containing is by definition the 
element of 

In what follows the homomorphism will not be needed; as a matter of fact it 
is completely determined by ( 1 ). 


Two important special cases of the homomorphism i?* 

( 1 ) Let jR be a closed set in the space i?* and/the identical mapping of R into jR*. 
The corresponding homomorphisms Hf and are nothing but the intersection- 
and the extension-homomorphisms and (operating respectively on 
and A5e) of my previous paper (Alexandroff 1943 ). 

An element z^j^ of is called extendable (over R*) if it is the image under the 
homomorphism of some element z^ of each of these Zj^ is called an exten¬ 

sion of the extendable element 

( 2 ) Let / be a mapping of the space R into the ^^-dimensional sphere S^. As the 

group Vgyi (with the group of all integers as coefficient-group) is infinite cyclic, a 
generator of this group may be chosen (one of the two possible); this generator z"^ 
is called an orientation of the sphere thus f becomes a mapping of R into the 
oriented sphere The image of z'^ under the corresponding homomorphism 

Hf is called the degree of/. In § 5 the following two theorems will be proved: 

Theor:bm 2-1 (the Hopf extension theorem). Let i? be a bicompactum of dimen¬ 
sion and/o ^ mapping of a closed set E<^R into the oriented 7 ^-sphere In 



On the dimension of normal spaces 17 

order that the mapping /q he extendable over R it is necessary and sufficient that 
its degree dj be so extendable. Moreover, for any extension d” of do there exists an 
extension / of /q with the degree d™. 

Thboeem 2-2 (the Hopf classification theorem). Let iZ be a bicompactum of 
dimension < n. The homotopy classes of mappings of B into the oriented w-sphere 
are in ( 1 : l)-correspondence with the elements of V^: to each element 
there correspond mappings of JJ in with the degree 2 ™, and two mappings are 
homotopic to each other if and only if they have the same degree. 

Our proof of these theorems will be practically identical with the proof by 
Hurewicz & WaUman ( 1941 ) for the metric case of the same theorems. The most 
elementary part of theorem 2-2 will be proved in this section, namely; 

2 - 20 . Two homotopic mappings of a bicompactum in a sphere have the same 
degree. 

This theorem is a special case of 

Theoeem 2*3. Let/be a mapping of the bicompactum R into the compactum R*. 
Then for any element z* of Vjj. there is a positive number 8 with the property: 

for every mapping gr of i 2 in i?* whose distance from / is < 5 

Hfz* = Hp*. 

Theorem 2-3 admits a further generalization which will be given after an auxiliary 
definition. 

Suppose the spaces B, B* are given and let &>* be an open covering of R*. Two 
mappings f,gofR into R* are said to be (U*-modifications of each other if for 
every point xeR the two points f{x) and g{x) belong to some element of cj*. Now 
the following theorem will be proved (which implies 2-3 and a fortiori 2-20): 

Thboeem 2-4. Let / be a mapping of the bicompactum R into the bicompactum 
B*. Then for any element z* of Vjj. there exists an open covering 

0 )* = {of,...,o*} 

of JJ*withthefollowingproperty:Ifgrisanyo>*-modificationof/theni?/ 2 * = Ep*. 
Proof. Let 2 ^. ez* with a* = {Of,..., OJ}. Take the covering 

= {OL ■■■, 0^, where 0 { = f-Wf=^A. 

There exists a covering 

A ~ •”> ROpi^ 0{, 

and a covering <i>* = (of,...,oJ|} 

with the property: if f{ROfi^r\of^A then ofsO*. 

Now let g be an ^^-modification of/. If a: € i?0^i then both/(a:) and g{x) are points 
of an of and f{x)^f{BOp^ o of; thus 

ofsOf, g{x)€Of. 


Vol. 189 . A. 


2 



18 P. AlexandrofE 

This implies 0?. Thus yff is a refinement of both cCf and Xg = {01}. 

It -will he shown that 2 „. is carried by both n^Hf mdir^sH* into the same z^. In 
fact, let .... he a simplex of the nerve Gf of y?. Then 

and ^ = (0?.,0?,) 

are simplices of andrespectively, and the values of ityH^Zg* and of if^HgZg* 
on are equal to the corresponding values of and 

the latter ones'are equal to the value of z„« on 4* = {0*^,0*J). Thus the V-class 
of B containing z^ is at the same time H^z* and H*z*, q.b.d. 


3. The bicompaot case 

Let G — {(/„, be an exact spectrum; if in each e C7 an element € Gg^ is taken 
in such a way that whenever /?> a, then the system i = { 4 } so obtained 

is called a thread of the spectrum G; the thread is called a part (or a sub- 

thread) of the thread ^ if for all a. A thread ^ is called maximal if ^ is 
not a part of any thread different from it. 

An exact spectrum G = {(7^, defines a bicompact space B(^ in the following 
way (Elurosch 1935 ). The maximal threads of G are the points of B^', the neigh¬ 
bourhood 0^{^) of § = J is defined as the set of all points i' = { 4 } such that 
W f attention of the reader is called to the following fundamental theorem: 

Thboeem 3-0 (Kurosoh 1935 ). Let J? be a bieompactum and C = {C'„,U 70 its 
spectrum. To any maximal thread ^ = {<„} of G there corresponds the point 

oc 

where M{tg^ is the intersection of the elements of a corresponding to the vertices 
of f„. The function/ establishes a homeomorphism between the spaces Bq and B. 

Thus the maximal threads of G are in ( 1 : l)-correspondence with the points of B. 
Now a ( 1 : l)-correspondence will be constructed between the closed subsets of B 
and certain subspectra of G called maximal. In construction this G = {C^,w^ stiU 
denotes the spectrum of the bieompactum B. 

Lemma 3-1. Let belong to G^, a being fixed. There always exists a thread con¬ 
taining as an element. 

Proof . Eor any yS > a take all simplices belonging to the inverse image of under 
(i.e. all satisfying = tf). Now take all projections of the simplices just 
obtained. Denote by iS the resulting set of simplices. I assert that the only element 
of S belonging to G^ is In fact, if e S’, then by definition where either 

= ^a. thus — ta, OT — vj^ty with y > yff and rnlty = Then y> and 
^ct = The sct S has, moreover, the property that the 

projection of any element of S still belongs to S. Finally it can be shown that for 
any tjjcS and y there exists a i^cS with w}ty = This is evident for ^ = a. 



19 


On the dimension of normal spaces 

Let y? 4= a. Then where w^tY = Take y" > 7 , 7 ' and ty>^ with wlf^ ty^ = ty. 

Then w'![ty>> = wXwlf4y» = thus ty-eS and wlfty^eS. Set m'fty. = ty. Then 

= m^pwlyty^ = wrfwpty = vf^fty = 

Thus S satisfies the conditions of Kurosch’s lemma of his paper ( 1935 , p. 472), 
and therefore contains a thread as the only element of 3 belonging to is 

this necessarily belongs to q.e.d. 

Let us call the closed subcomplex an extension of the subcomplex of G^ 

if A^^A'g^^ and each maximal simplex of A'a, belongs to the open star of some simplex 
of Ag^, The subcomplex A itself is by definition its own trivial extension. 

The subspectrum J.' = of the spectrum G = {G^, tuA} is called an extension 

of the subspectrum A = {A^, ur^} of C if every is an extension of A^ (as subcomplex 
of Gf). Again, the subspectrum A is its own trivial extension. The subspectrum A of 
G is called maximal if it has no extensions except the trivial one. 

The purpose of the present section is to prove the following three assertions: 

(1) Any maximal subspectrum A = of the spectrum G = {Ga^,wQ 

satisfies the condition:— 

Every thread of A which is maximal in A is maximal in C. 

( 2 ) Every subspectrum AofC satisfying the condition (S^) can be put in the form 

A=-EG, 

where J? is a closed subset of the bicompactum B, 

( 3 ) Every subspectrum A of (7 of form A = EG (with a closed jE? c J?) is maximal. 

These three assertions are obviously equivalent to the following two theorems: 

Theorem 3 d. The condition {S^) is necessary and sufl&cient in order that the 
subspectrum A be maximal. 

Theorem 3 - 2 . In order that a subspectrum A of the spectrum G be of the form 
A = jE 7(7 it is necessary and sufficient that A be maximal. 

Before proceeding to the proof of these theorems we note that to different 
sets E there correspond different subspectra EG. In fact, if a; is a point which 
belongs, say, to jB' but not to E", let an (exact) covering a be chosen with an 
element e^ containing the point x but no point of E". Then the vertex e^ belongs to 
= A a and does not belong to Thus the subspectra A' = E^G and 

A" = E"G are different. Therefore theorem 3-2 shows that the operator EG (with 
variable E) realizes a ( 1 : l)-correspondence between the system of the closed subsets 
of B and that of the maximal subspectra of G. 

The proof of ( 1 ) is rather obvious: let x = be a thread which is maximal in A 
but not in G. Let x' — {^'} be a non-trivial extension of this thread; add to each A^ 
the simplex t'^ with aU its faces. The complexes A^ so obtained form a non-trivial 
extension of A. 

Proof of ( 2 ). Let A = {Ag^,mQ be a subspectrum of the spectrum C = 

Suppose that A satisfies the condition {S^). Then to each maximal thread of the 



20 


P. Alexandroff 


spectram A there corresponds a point of i2; the point set jE? of i2 so obtained, being 
obviously homeomorphic to the space defined by A , is thus bicompact and therefore 
closed in JS. It remains to show that A = EO, To this end take an^ 

~ (^aO • • • ^ccr) ^ 

and a point xeE ... By the definition of E the point x may be identified 

with a maximal thread £ = of A\ the simplex ^ simplex of A^\ 

hence € A^. Since this is proved for an arbitrary e EO^, EG^ c ^. To prove the 
inclusion A c EC^, consider any e A^ and take a maximal thread 

with (such a thread exists by lemma 3*1). The point a;€ ^7 is contained in all 
elements of a corresponding to the vertices of in particular, to the vertices of 

therefore € EC ^, q.e.d. 

Proof of (3). Suppose that the subspectrum A = EG = ztj^} is not maximal. 
Let A' = be a non-trivial extension of A; in particular, let for a fixed a 

the subcomplex A' be a non-trivial extension of A^. Finally, let 

“ (^aO * • * ^ar) ^ 
ta = (^aO • • * 6ar ««,m) ^ 

The vertices 6 ^©,correspond both to the elements E^q, of a and to the 
elements E a E^q, ...,En E^^^ of Ea. According to lemma 3*1 take a thread 

of the spectrum A\ Let, for any /? > a, 


— (c^o... ^ . 


The spectrum A' being an extension of A, there is certainly a simplex in G^ 


satisfying both 


^and 


In other words, any element of corresponding to a vertex of (even to a vertex 
of fp) has common points with an element of yff intersecting E. Now take a maximal 
extension T = (• - -C- • ..) (in (?) of The thread may be identified with a point 
a; of i2 which for any y? > a belongs to an element of P having common points with 
some element intersecting E. As this is true for every ^ > a, it is seen that xeE. On 
the other hand, x belongs to aU elements of a corresponding to vertices of and, in 
particular, to those of Hence it follows that contrary to our hypothesis. 

Thus the assertion (3), and with it theorems 3*1 and 3*2, are completely proved. 

Rewmh. Since the spectrum G (being the exact spectrum of the bicompactum B) 
is a Hausdorff spectrum, any thread of G is contained in only one maximal thread. 
Hence 


CoEOLiAEy 3*1. If A is a maximal subspectrum of 0, then A contains, together 
with any thread, its maximal extension (in C). 



On the dimension of normal spaces 21 

Consider in the spectrum C = a system of closed subcomplexes 

(for all a) such that for every /?> a the projection wi maps into A^ (not neces¬ 
sarily on to Af), Such systems (with the corresponding projections) will be called 
for a moment ^incomplete’ subspectra of C. In order that an incomplete sub¬ 
spectrum A = TZ7^} be a subspectrum it is necessary and sufficient that every 

t^€.A^eA can be included in a thread | = (..i^...) (of A). Two incomplete sub¬ 
spectra are called equivalent if they have the same threads and the incomplete 
subspectrum A^ = is called a reduction oi A = if A and A* are 

equivalent and J.'for all a. Applying the reduction process transfinitely for 
the incomplete subspectrum A = [A^.w^ a reduction = {Al^.vr^ is obtained 
which is a subspectrum of C. It is easily seen that each A^ has as elements all those 
and only those t^eA^ which are contained in the threads (..i^...) of A. 
Thus, the complete reduction A'^ of the incomplete subspectrum A is uniquely 
determined. 

Now let two subspectra A' = {Aa,tEr^} and A" = of the spectrum C be 

given. Then the complexes A"^ = A'^r\A"^ (with the projections U7^) form an in¬ 
complete subspectrum A"^ = Its complete reduction is called the inter¬ 

section of the subspectra A' and A" and is denoted by A' n A". 

Now, from corollary 3-1 there follows 

Corollary 3‘2. The intersection of two maximal subspectra of C is a maximal 
subspectrum. 

Moreover: 

Corollary 3-3. If A' = E'G, A" = E"G, and A = A' n A", E = E'nE'\ then 
A = EG. 

Proof of Gorollaries 3*2 and 3*3. Any maximal thread of A is a thread of both A' 
and A"; these subspectra being maximal, the maximal extension of ^ (with respect 
to G) is contained in A' as well as in A" and thus in A; therefore ^ is a maximal 
thread in C, which proves the maximaKty of the subspectrum A. Furthermore, as 
has just been shown, the point xeB corresponding to the given maximal thread ^ 
of A belongs to both E' and E", On the other hand, if a; € then the maximal thread 
corresponding to a; is a thread of both A' and A", and thus a thread of A. In other 
words, the maximal threads of A are just the threads corresponding to the points 
of E. This, together with the argument used in the proof of the assertion (3), com¬ 
pletes the proof of coroUary 3*3. Now we say that the spectrum G = {O^, is the 
sum of its maximal subspectra C' = {(?', mQ and G" = {G'^, if G'^kj G^ for 
each a. 

With these definitions, the (1: lycorrespondence between the maximal subspectra 
of the spectrum G {of the bicompactum B) and the closed subsets of B is allomorphism 
with respect to both addition and intersection. 



22 


P. Alexandxofi 


4. Dimension 

The dimension of the nerve of a covering is called the dimension of the covering. 

Let iJ be a normal space. If each open covering w of J? has an T^^-dimensional 
(open) refinement e, i? is said to be at most 71 -dimensional. In this definition we 
may suppose e to be closed instead of open, or to be open canonical, closed 
canonical, exact. If is at most 72 ^-dimensional but not at most (72> — 1 )-dimensional, 
the dimension of jB is said to be 71 :. 

dimi? = n. 

It is one of the main purposes of this paper to prove 

Theorem 4-1. If the dimension of the normal space is 72 ^, then the exact spectrum 
0 of R contains a maximal subspectrum Gq such thatf 

7^(0 mod Co, /) =t= 0 (dually: A"(C mod Cq, J) 4 = 0 ), 

while for r > ti, for any maximal subspectrum Cq and for any coefficient-group, the 
r-dimensional Betti groups of C mod Oq are zero. 

Theorem 4 - 2 . If the dim ension of the normal space R is 7 ^, then R contains a 
closed (and even closed canonical) set E such that 

V^(J? mod JE/, J) =# 0 (dually: A^(jB mod jB, /) 4 = 0), 

while for r>n, for any closed E^R and any coefficient-group the r-dimensional 
Betti groups of RmodE are zero. 

Owing to theorem 3-2 the theorems 4-1 and 4-2 are immediately equivalent if R 
is bicompact. In this special case both theorems will be proved in § 6 ; the general 
case will be solved in § 6 . In the present section some important but more elementary 
theorems will be proved. In the proofs given I often follow my paper ( 1940 ). 

Preliminary definition, A closed E in the normal R is called correct if there exists 
a continuous real-valued function f on R such that f{x) = 0 if and only ifxeE, One 
can easily see that the correct closed sets are nothing but those closed sets which are 
intersections of an enumerable system of open sets. The complements R \ E of 
correct closed sets E^R are called correct open sets. 

Le mm a 4*1. The closed set E and its neighbourhood 0{E) are arbitrarily chosen 
in the normal space R, Then there exists a correct closed set A which contains E 
and is contained in 0(E). 

Proof. By the classical Urysohn lemma there exists a continuous function / 
defined on the whole space R and such that 

1 ^/( 0 :) = 0 on £?, 

\f{x) = 1 on J? \ 0(E). 

Define A as the set of all a; e JB with/(a;) = 0 . 

t The additive group of all integers is denoted by J, its character-group (i.e. the continuous 
group of all real numbers reduced mod. 1) is denoted by J. 



23 


On the dimension of normal sjpaces 

It will be convenient for us to bring lemma 4-1 into the following form; 

Lemma 4-2. For any open set 0 in the normal space R and for an arbitrary closed 
E^O there exists a correct open set TJ such that E^U^O, 

Proof . Take the correct closed set A under the condition R'^O^A^R'^ Ey then 
TJ = jB\^ is a correct, open set, and E^U^O. 

An open (closed) covering whose elements are correct is itself called correct. 
From lemmas 4*2 and 4*1 follow respectively: 

Lemma 4*3. To any open (closed) covering 
there corresponds a correct open (closed) covering 

such that in the open case ^ Af while in the closed case c ; and the coverings 

e and e' have the same nerve. More precisely: if the closed set E^^3£i (the open set 
are given for i = 1,2, 5 , then the can be chosen to satisfy the con¬ 
ditions E^ c Af^ c in the open and the closed case. 

Let 0 ) be an open covering of the normal space R and / a continuous mapping 
of R in a space B'. This mapping is called an w-mapping if every point x'^B' 
has a neighbourhood whose inverse image under / is contained in some element 
oio). 

Bemark. In the case of a closed mapping / it is sufficient to suppose that the 
inverseimageof every point a:' e JJ'is contained insome o cw. In fact, ioif~^x') Qoeo) 
the set f{B \ o) is closed in B' and B' '^f(B o) is a neighbourhood of x' with an 
inverse image lying in o. 

Let J2' = C be a polytope given in a definite simplicial decomposition C; among 
the w-mappings of the normal space B into B' of special interest to us are those 
which satisfy the condition; 

The elements of o; = {oi, are in (1: l)-correspondence with the vertices 
Bi of 0, and each is the inverse image of the open star of the corresponding vertex 

of C (it follows that 0 is the nerve of w). 

These w-mappings wiU be called (w, 0)-m(yppings;'\ they are characterized by the 
following construction: 

Suppose that for i = 1, ...,s the continuous non-negative functions /if^x) are 
defined on B by the condition that /i^ix) = 0 if and only if a: e iJ n (this implies 
that 0 ) is correct). Define for any see i? the point/(*) as the centre of the masses 
fti{x) placed respectively at the points e^. This centre lies in the simplex of O spanned 
by the vertices corresponding to the elements of containing x. The mapping/so 
defined is an (w, C')-mapping; conversely, every (o, (7)-mapping can be obtained in 
this way: to get the /i^ix) take the (open) simplex (e^^.-Ci^) to which/(») belongs 

t After Kuratowski I studied these mappings in 1940; Hurewicz & Wallman call them 
‘ barycentric ’ o) -mappings. 



24 


P. AlexandrofE 

and set fii{x) = 0 if i differs from aU io, -. 'while for i = i* the value is defined 

as the barycentrie coordinate of f{x) corresponding to the vertex e^j, in ... e^^). 
This same constmction leads to 

Lemma 4 * 4 . Let a> be a correct open covering of the normal space i?; let 0 be 
the nerve of o) and jR^ = G a polytope with the simpliciail decomposition G. Then 
there exists an (co, G)-mapping of B into jB'. 

This result, together with lemma 4-3, implies 

TiTAMM A 4 - 5 . To any open covering o) of an n-dimensional normal space correspond 
(i>-mappings into yi-dimensional polytopes. 

Using the Wallman-Hurewicz procedure of their p. 73 the mappings into in 
Lemma 4*5 may be replaced by mappings on to. 

On the other hand: 

Lemma 4 * 6 . For every w-dimensional B there exists such an open covering o) 
that no (y-mapping of B into a bicompactum of dimension — 1 is possible. 

Proof. Suppose that the contrary is true and that to any open covering cj of the 
given B there corresponds a bicompactum B'^ of dimension —1 and an o)- 
mapping/^ of jB into jB^. Take for any point x' eB'^ a, neighbourhood of such 
that its inverse image lies in some element of o). Choose a finite covering 
{o[, ..., o'.} = 0 )' of B' composed of some of the above 0 ^... Let cy" =: {oj,..., o^'.} be an 
{n— l)-dimensional refinement of (i)'. The inverse images o^ = form an 1)- 
dimensional refinement of (y, contradicting the definition of O). 

The following fundamental theorem has just been proved: 

Theorem 4*3. The dimension of the normal space B is the least among the 
integers n with the property: for an arbitrary open covering c<> of jB there eixists an 
a>-mapping of B on to an n-dimensional polytope. In this formulation the words 
‘ on to ’ may be replaced by" into ’. * 

Let/be a mapping of the normal space B into J?', where jB' is the ^^-dimensional 
Euclidean or spherical space, or Hilbert space. Suppose that there is given a positive 
number e and a correct open covering cooiB which is ‘ fine enough ’, in the sense that 
whenever x and y belong to the same element of o) (Hurewicz & 
Wallman 1941 , p. 63). Choose in each element o^ea) a, pointand take the vertices 

of the nerve C at or near the pomts/(^^-). Define the functions /i^{x) as above, and 
denote by /^{x) the centre of the masses fOiix) placed in e^. In this way a continuous 
mapping of B into the polytope C is obtained (which is a geometric realization 
of the nerve G in the space i?'). It is easily seen that for an arbitrary xeB 

PifWe{^))<e. ( 1 ) 

Thus 

Approximation Theorem 4*4. To any e > 0 and to any continuous mapping of 
the f-dimensional normal space B into the spherical or Euclidean or Hilbert space 
B there exists a continuous mapping/^ of B into an r-dimensional polytope imbedded 



25 


On the dimension of normal spaces 

in jR', the mapping/^ being an e-approximation of/in the sense that (1) holds for 
aUxell. The polytope C may be chosen as the geometric realization of the nerve G 
of any sufliciently fine correct open covering of B, 

From this approximation theorem there follows immediately: 

Lemma 4*7. For r<n any continuous mapping of an r-dimensional normal space 
into the w-dimensional sphere S'^ is inessential; thus, any two such mappings are 
homotopic to each other, ^ 

Furthermore, we shall prove by the methods of Alexandroff & Hopf ( 1935 , 
pp. 374-375): 

Lemma 4*8, If / is an essential mapping of the normal space B on the closed 
simplex T, then there exists an e > 0 such that any mapping of B into T differing 
from / by less than e covers a fixed simplex concentric with T. Thus for r < 72 . there 
is no essential mapping of an r-dimensional space on an ^.-dimensional closed simplex. 

Proof of Lemma 4*8. Let / be an essential mapping of the normal B on the closed 
spherical region of radius 1 and centre 0 . Denote by e^, the spherical regions 
of radii 1/3, 2/3, respectively, and the same centre o; the boimdaries of E'^, E''^, 
are the spheres S, S\ s. Denote finally by H the closed region between the spheres 
8 and S\ and set « « ,. 

Let/i be any mapping of J? in E'^ differing from/by less than 1/3. Then for x € Fthe 
rectilinear segment/(a;)/i(a;) has no points in common with 

Now define the mapping/' as follows. For xeP draw the line qf{x) and take the 
portion S of this line lying between 8 and 8'; the segment 8 is considered as directed 
from 8 to aS' (i.e. towards the centre 0 ). Now/'(a;) is defined as the point which divides 
the segment/(a;)/i(a;) in the same ratio 8bsf{x) divides the segment 8. 

PorxeB \ F one simply sets/'(rr) = /i(a;). This definition implies that for re €/"^(aS). 

f'{x) =f{x), 

and hence, since/is essential, f\B) = 

But if/'(ai)€ e^\ then certainly xeB'^ P, and therefore f^{x) = f'{x). Consequently 
q.e.d. 

We now return to lemma 4-4 and consider an (ct>, C/)-mapping of JJ in the nerve C 
of 0 ), With Hurewicz & WaUman { 1941 , p. 73) we fibnd a mapping / of J? on to a 
certain O', C' which is quasi-barycentric (i.e. the inverse image of any open star 
of C' is contained in some element of o)) and minimal in the sense that there is no 
quasi-barycentric mapping of B into a <7" with C" c O'. This implies (by virtue of 
my 'sweeping out’ reasoning (Alexandroff & Hopf 1935 , p. 365)) that every n- 
dimensional simplex of C' is essentially covered by the image of R under/. Hence 

Theorem 4*5. The dimension of a normal space B is, if finite, the greatest among 
those integers n which allow an essential mapping of B on to an n-dimensional 
closed simplex. 



26 


P. Alexandroff 


Remark. There are sets M lying in a normal (and even bicompaet) B and having a 
dimension greater than that of i?; if ilf is closed in R, then necessarily dim M ^ dim R. 
A closed Mine, normal R being always normal, we are led to the following problem: 

Problem. Let M<=Rhe normal (if taken with the ‘natural’ topology induced by 
the topology of R). Does then the inequality dim if < dim R necessarily hold? 

In any case the following definition will be useful to us: 

Definition. Let if be a subset of the normal R. The greatest among all numbers 
Him E, where E^M is, closed in B, is called the relative dimension of M with respect 
to B and is denoted by xd^M. 

Remark. If B is bicompact then rd^jif is equal to the greatest among all dim E, 
where E is any bicompactum lying in M. Thus in this case rdjjif is an absolute 
invariant of M and may be denoted simply by rd M. 

With the notion of the relative dim ension a theorem of Hurewicz & W allman ( 1941 , 
their corollary on p. 84) may be generalized as follows: 

Theorem 4* 6 . Let E be closed in the normal R. If rd^ {B '^E)^n, then every 
mapping of E into the n-dimensional sphere 8‘^ can be extended over R. 

Proof. Let / be the gived mapping of E into 8”'. By the Brouwer-Urysohn exten¬ 
sion theorem,/can be extended over a neighbourhood U of E to a. certain mapping, 
/'. Take the neighbourhoods and of E satisfying the conditions 

£? S C 4 =J 2 C 4 c c iJUl c 17, 

and consider the function /' on (RUj) r\(R\ U^). The latter set is closed in i? \ C 4 , 
which is closed in R and therefore at most ra-dimensional. Thus, by theorem 4'6,t 
the mapping/' of {BUj) r\ (iJ \ U^) into 8“^ is extendable to a mapping/" of ^ U^. 

f"’ix)=f{x) on BU^, 

/'"(a:) =/"(*) on B\U^, 
the desired extension of/is found. 

The theorem just proved will play an important role in the last section of th-ia. 
paper. 

To finish the present section the following proposition wUl be proved, which is 
important for many deformation-theorems and ia particular for the Hopf classifica¬ 
tion theorem as well as for the theorem on Cantor manifolds: 

Theorem 4*7. The topological product [J?, f] of an w-dimensional bicompactum 
B and the rectilinear segment I = < 0 ;l> = { 0 <Kl}isof dimension < » + 1 . 

Before proceeding to the proof of this theorem I shall call the attention of the 
reader to two questions which naturally arise in connexion with it:* 

(1) Is it possible to strengthen the inequality dim iR,I]^n+lto the equality 
dim[J?,/] = » + 11 

t Here theorem 4-5 is used in. its following form which is immediately equivalent to the- 
previous one: 

Theobem 4-50. dim R^n holds if, and only if, every mapping of every closed EczR into- 

can be extended over B, 



27 


On the dimension of normal spaces 

( 2 ) Does theorem 4-7 remain true for an arbitrary normal space i?? 

Proof of Theorem 4*7. Take an open covering Q = of the bicompactum 
[i 2 , /]. It will be proved that Q has a closed refinement of dimension < n + 1 . 

To this end, choose for any point {x, t) e [i2,1] an element of Q containing 
{x, t), and take an open set o^f^B and an interval such a way that 

(x, t) e J c Q. 

The point x being considered as fixed, choose a finite number of such products 
\.Ox,t’ covering of the rectilinear segment \x, I] £ [5, J]. Let these 

^x.t} be 

•••> \Px,tr> 

Set o^i^n ...c\ 0 ^^^ = 0 ^. 

Then for any f, 0 < f 1 , [Oj., i] is contained in some element of D. 

Now, considering a: e iJ as variable, choose a finite number of the o^ just defined 
so as to obtain a covering 

CJ = { 0 i,..., 0 j 

of R. This covering stiU has the property that for any o^ and any f, 0 < i < 1 , [oj, <] is 
contained in some element of D. Let 

e = {ei,...,eg} 

be a closed n-dimensional refinement of o). As before, each of the lies in some 
Oi €i3. A positive 5 can be found in such a way that for any interval 7 <= J of length 
< 8 and an arbitrary € e the point set [c^, tj] in [i?, J] is contained in some Oj ^.Q. 
Now denote by 

O' = •^ 0 , 1 ^, 

a subdivision of the segment I = < 0,1 > into consecutive segments 

^ 0 , 61 ), 1 ) 

of length < 8. Take s such subdivisions 

o-i = {0,6i...,6l,l} 

under the sole condition that for i + j and arbitrary h,lc,b\ and 6^ are always unequal. 
Denote by rj%, ijl (with k= 1, 1) the segments <0,6f), (bk)bk+x> 1) 

respectively and set 

= C^i.* = 0,1,. 

These jEj*. form a closed covering E of [i?, J] which is a refinement of Q. It will 
now be proved that the dimension of the covering Ai is <% +1. To do this, first 
note that 

(1) jEjftnjEj*+.4onlyinthecase |A—= 1. 

(2) If [x, t) € Eij, n A+i, then t = 

(3) If ij,..., are all distinct, then for an arbitrary choice of ..., 

61 •.. O An+a 



28 


P. AlexandrofiE 

Now let {x,t)e.jEf^ji^r \... Then at least two among the i s, say 

are equal, = 1 ^ = i and t = 6 ^+ 1 , where h = hi,h^ = h+\, and 

{x, t) = {x, 6 ft+i) € r\ n A ... ri Ef^^^ hn+t' 

But 6^+1 is different from ah bi‘, ..., 6 |“+% and therefore the numbers iJa, ..., i^+s 
are aU, distiuct. Thus the n + 2 point-sets 

■“> "^in+sAn+s 

have a non-empty intersection, in contradiction to assertion (3). This completes 
the proof of theorem 4-7. 

COBOLLAEY. Let i2 be a bicompactum and M a point-set in B with vdM^m. 
Then, for B' = [B, 1] and M' = [if, /] £ B' 

rdif'^m-f 1 . 

Proof. Let E'^M' be closed in B' (and therefore a bicompactum). Denote by E 
the projection of E' in B. Then, since E' is closed in [E, /], 

dimjS'<dim[ir,/]<m-|-l, q.e.d. 

5. PeOOP of THEpEEMS 2-1, 2-2, 4-2 (EOE BIOOMPAOTA) 

First some preliminary remarks. 

Lemma 6 - 1 . Let E be closed in the space B. Let/j and g^, be two homotopic 
mappings of E into the «.-sphere jS". Then, if there exists an extension/ of/o over B, 
there also exists an extension g of g^ over B with / and g homotopic. 

This is ‘Borsuk’s theorem’ or ‘Theorem VI. 5’ of the book of Hurewicz & Wall- 
man; their proof, given for the metric case, is literally appHcable to our general case. 

Theorems belonging to the topology of polytopes so far as they are needed in this 
paper are supposed to be known. Among these the attention of the reader is called 
to the iuvariance theorem concerning the Betti groups. The interpretation of this 
theorem for V-groups is as follows. Let 0 be a simplicial decomposition of the poly¬ 
tope G. Then 0 is the nerve of the open covering or oiC whose elements are the open 
stars of C. Therefore, any V-cycle on <7 is an element of a V-class of G. Conversely 
(and this is the meaning of the invariance theorem) each V-class of G contains 
V-cycles of the complex G, and two V-cycles of C are homologous on C to each other 
if and only if they belong to the same V-class of C. Thus there is a fixed ( 1 , 1 )- 
correspondence between the elements of V^ and those of VJy, this correspondence 
being an isomorphism. 

Now let C be the nerve of a correct open covering w of the bicompactum B. Let 
C be realized geometrically as a simplicial decomposition of a polytope G and 
consider any («, 0)-mapping f oi B into G. Let us consider what is the homo- 
morphismt H* of V^ = VJ, in Vjj. To this end let ^ be an element of Wg, and sfef 
a V-oycle on C. 

t The homomorphisms Hf, J% etc., were defined in §2. 



29 


On the dimension of normal spaces 

Since the complex C is at the same time the nerve of cr and the nerve of w, and/is 
an {(x), C')-mapping, the elements of co are the inverse images under / of the-elements 
of cr\ therefore by the definition of the homomorphism H*, H* f is nothing but the 
V-class of it to which belongs as element. Thus the following has been proved; 

Lemma 5-2. Let/be an {(o, <7)-mapping of the bicompactum B into the polytope 
O, whose simphcial decomposition C is the nerve of w. To obtain the homomorphism 
JET/ of in Vjj one has to take in any V-class g*" e Vg a V-cycle z*" of (7; the V-class 
of B containing z*" is the element H* 

Proof of the Hopf extension theorem. (1) Necessity. Let/be an extension (over the 
bicompactum B) of the mapping/j of the closed set E<=-B into the bicompactum B*. 
Then on E 

foi^)=fiJi^))> 

where J is the identical mapping of E into B. This implies that for any element 

H^z* = 

In other words: each element of V^ which is an image under (of an element z* 
of VJp) is an image under (of the element Hf z* e A^), i.e. is extendable. Hence 
it follows, for B = S'^, that the degree of/o is extendable (to the degree of/). 

(2) Sufficiency. Let dim B<,n+1, E‘=^R and let / be a mapping of E into 8^ with 
degree d e suppose d extendable to g e 

d ~ 

One can easily see that there exists an open coveidng 

a* = {0f,...,0*} ofjR 

with the following properties: 

(а) There is a g„. e V”. in g, i.e. g^. £ g. 

(б) There is for any 0^ at least one vertex e of (the fixed simplicial decomposition 
of) such that/(jE/ n OJ") is contained in the open star Oe of e. Denote by a fixed 
one among these vertices e. 

(c) The covering a,* is correct and of dimension < -f 1. Denote by a the covering 
of E whose elements are the (non-empty) intersections EnO* (remember that 
these are the inverse images of the 0* imder the identical mapping 1 of E into B). 

Let be the simplicial mapping of into the complex S”’ determined by cor¬ 
relating the vertex e^ to {E c Of). Let e VJf be the degree of and Zq the V-class 
of E containing g* (as subset). I say that go = d. To prove this take the funda¬ 
mental V-cycle x'^ of 8”-, define on (7„ by its values on the elements tfeC,^: if the 
vertices of 4” correspond to {E n Of), ...,{E r\ 0*,), we put 

{z^.t^) = x^{ei^...ei^). 

Obviously z® e z„. 

Now any Eo Of is contained in ga^Oei); thus a is a refinement of cu = {gaHOCi)} 
and (E n Of)-s~g~\Oei) is a projection. Therefore z”€d and hence g^sd- 



30 


P. Alexandroff 

Now 

ia* — h ia — ^’ 

and Jfj is by definition the V-class of E containing 5„». As 3 a€d = 
then (for an appropriate fi* > a* and ^ = Eft*) 

or (as Jf commutes with n^) 

Eeplacing our a by this y? and setting again a for y? it may be supposed that fronoi the 
beginning 

^a ia* ^ Sa> 

i.e. that is an extension of 

Now take the pol 3 rtopes and ^ C^* and consider as a mapping of 0^ into 
S^, As ga* is an extension of g^, one has by the Hopf extension theorem for poly¬ 
topes an extension (over C^*) with degree g^* of the continuous mapping g^. 

Let be an (a*, (7a*)“mapping of B into (7^*; then 6^* by its very construction 
maps E into and 

9{^) = 9a{W) (^€JEf), 

is a mapping of E into 

9*{^) = 9AK*{^)) 

is an extension of g over R, 

The degree of gr* is the image of the degree g^* € of g^* under the homo¬ 
morphism (of in V^) induced by But in virtue of lemma 6-2 and of 
ga*^g this image is nothing but g. Moreover, one concludes easily from the 
defimtions of g^ b^* and g^ that for any xeE both/(a?) and g{x) lie on some closed 
simplex of the complex S'^, whence it follows that/ and g are homotopic. As g admits 
an extension over i?, namely, gr*, the mapping/admits (by lemma 5-1) an extension 
/* which is homotopic to g^* and thus has the degree q.b.b. 

Prom this theorem there follows as an easy corollary one of the assertions of the 
Hopf classification theorem, namely: 

CoROLLABY, Let the bicompactum R be of dimension ^ n. Corresponding to each 
element g^ of there exists a mapping of R in with the degree g’^. 

Proof. Denote hj E ^ closed subset of J?, containing one point only. Then the 
only element of V| is the zero element; this element is of course extendable, and 
any element of V^, in particular, the element g^, is an extensibn of zero. Therefore, 
by the Hopf extension theorem, a mapping of E into (which correlates to the 
unique point of E an arbitrary (but fixed) point of S'^) can be extended to a mapping 
f of R into with the degree g”-. 

The second part of the Hopf classification theorem is contained in theorem 2*20. 
There remains the third part: 

Two mappings of the at most 7i-dimensional bicompactum R into S'^ are homo¬ 
topic if they have the same degree. 



On the dimension of normal spaces 31 

The proof of this statement follows from the Hopf extension theorem and theorem 
4*7 (see Hurewicz & Wallman 1935, pp. 149-150). 

We now proceed to the proof of 

Theorem 5'1. If the dimension of the hicompactum R is w, then R contains a 
canonical closed set E such that there exists a which is not extendable 

over R\ on the other hand, for r'^n and an arbitrary choice of the closed set E<^R 
and of s'* € A^, the element is always extendable over R, 

In other words: 

Among all finite-dimensional bicompacta the 9i-dimensional ones are completely 
characterized by the property: the number tz, — 1 is the greatest among those numbers 
r for which there exists a closed EczR and a e non-extendable over R. 

The proof is evident: By theorem 4-50 there is a mapping / of a closed E^R into 
^71-1 is not extendable over iJ; by the Brouwer-Urysohn extension theorem, 
E can be supposed canonical; the degree of the mapping / is a non-extendable 
element of On the other hand, let a closed E<=^R and {r>n)y be 

chosen arbitrarily. Take the confinal part of the spectrum of the open coverings of 
R consisting of all 72,-dimensional coverings and consider among the only 
those which belong to the nerves C of the elements of this confinal part. Since, 
for r>n, all these are zero, it foDows that g** = 0, and g*" is extendable. For 
r — n, the 2:2'* (being 7i-dimensional chains on the 7^-dimensional complexes 
are V-cycles of the complexes (7^, and g^ is again extendable. 

We shall now put theorem 5* 1 in some different forms. For this purpose notations 
introduced in Alexandroff (1943) will be used. Hence 

If R is an ?^-dimensional bicompactum, then there exists a closed E^R such that 
+i.e. or, what is the same, A|t 7^4=0, while, on the other hand, 

for any closed Ec:R and r^n V^e:r = ^ i^E:R = ^)* 

The "A-form’ of the last theorem means: 

There is, in the Ti-dimensional bicompactum i?, a closed E<=-R and a thread 
= {Qr^} € A|ri which ‘does not bound on E' (in the sense that is not the 
zero element of Ajgf^), but ‘bounds on in the sense that if the covering aoiE has 

the form a = J5a*, a* being a covering of i?, each of the cycles ^ bounds 
on a*. On the other hand, there is, for r > no e A^ which is different from zero 
and bounds on R (and, as follows from the proof of theorem 5-1, for T>n, no 
different from zero at all). 

In these theorems the Betti groups may be defined by means of any of the spectra 
(1)“(5) of § 2 or even by means of a confinal part of any of them. In particular, we 
can use for R the exact spectrum Z and E for the transitive spectrum EZ whose 
elements are the nerves of the coverings a of JS? of the form a = -Ba*, where a* is an 
exact covering of i?. This allows to speak about the true cycles on R and on E 
respectively: a true cycle on R (on E) is a thread where the cycles 

on a* (on jBa*). The cycle = {z^^ on E bounds (on E) if each bounds on Ea*; 
the cycle z^ = {zQ (on E) bounds on iJ, if each bounds on a*. These definitions 



32 


P. AlexandroS 


allow us to give the following form to theorem 5-1 which exactly corresponds to 
the original form of the fundamental dimension theorem as given in my paper 
{ 1932 ) for compacta (‘ 2 . Hauptsatz’ on p. 195 of that paper): 

An ^-dimensional bicompactum B necessarily contains a closed set E and on it 
an {n— l)-dimensional true cycle which bounds on B but does not bound on E. On 
the other hand, for any r>n and any closed E'=-B, each r-dimensional true cycle 
on E bounds on E, provided that it bounds on B. 

The third pair of isomorphisms of my paper ( 1943 ) will now be applied, i.e. the 
isomorphisms 

Let B be an M.-dimensional bicompactum. According to theorem 5T choose the 
closed <= jj so as to have V|. 7 ^=|= 0. Then by the above isomorphism 
and since S ^rs.e ® fortiori 0 . Dually ^■rse’^ 0 . 

Taking a confinal part made np of n-dimensional complexes (e.g. in the spectrum 
of the open coverings of R), one sees at once that for r> 9 ^ and any choice of the 
closed set jB Cl jR the groups ^r\e zero. Thus theorem 4-2 is completely 

proved in both its (V- and A-) forms. 

Remark, In aU formulations of theorem 5*1 as well as in theorem 4*2 the closed 
set E can be supposed canonical. 


6. Normal SPACES. Proof op theorem 4*1 

A bicompactum bR is called an extension of the space JB if jB is an everywhere 
dense subset of bR, i.e. 

R^bR, {bR){R)^bR. 

Among the bicompact extensions of R there is one called the Cech extension or the 
maximal eodension This extension is completely characterized by the property: 
^R can be mapped on to any bR in such a way that every point of JS is a fixed point 
of this mapping (and that no point of ^R \ JS is carried into a point of R), 

I have shown (Alexandroff 1939 ; see also Fomin 1943 ) that ^R may be obtained 
from R by the following very intuitive construction: 

CoTisfrwtion of fiR, A system S of subsets of a given set M is called centred 
if any finite number of the sets has a non-empty intersection. A system 8 of open 
sets of the space R is called regular if to each H^eS may be found in 

such a way that A centred regular system 8 of open sets Hg^^Ris called 

an end (of the space R) if there is no centred regular system (of open sets of R) 
different from /S and containing as a subsystem. The ends x = of R are by 
defimtion the points of fiR; to get the inclusion R c ^R any point xeR will be iden¬ 
tified with the end composed by all open sets of R containing the point x. The open 
sets Egg which are elements of an end x are called the coordinates of the point xeR, 
To each open E^R there corresponds the set 0^^ fiR of all points x €/?i2 such 
that Eex {in other words; 0^ is the set of aU points xe^R having E among their 



On the dimension of normal spaces 33 

coordinates). Open sets in yffJ? are defined as the sets 0^, together with all their 
sums. In this way is topologized. The same topology may obviously be obtained 
by defining as a 'neighbourhood space’, where a neighbourhood of the point 
X = {jffJ is any set of the form 0^^. 

The space ^R can also be defined by the following construction due to Wallman 
( 1938 ). Points of pR are defined as maximal centred systems of closed sets ^^R, 

* = 

('maximal’ means again that x is not a proper subsystem of a centred system of 
closed sets of i2). A point xe J? is in this definition identified with the system of all 
closed ^ containing the point x. In this way again R c ^R. If tp is any closed 
set in R then is by definition the set of all rc e ^R with (^ex. Obviously §5 ^ E^. 
The topology in yffJS is now introduced by the definition of closed sets in fiR as the 
sets of the form E^ (with (p closed in R), together with aU intersections of such sets. 

Theorem 6 * 10 . If we put in correspondence with each canonical closed set A 
of the space R its closure (^R)A in ^R, a ( 1 : l)-correspondence between the 
system of aU canonical closed sets of R and the system of all canonical closed sets 
of fiR is obtained; this ( 1 : l)-correspondence is an intersection-isomorphism, i.e, 

(1) ... r\Af = A implies {fiR)A^f\... ri{^R)Aj.=^ A, and 

( 2 ) iniplies Aj^^A^* 

Proof. We first prove (1). Let Ei — Ej^^, Since E^'^A^ is closed, then 
A^ c (yJjB) A^ c E ^; thus ( 1 ) is contained in 

(3) ^ A^r\... r\A^ — A implies E-^^n ... r\E^^ A.* 

Now xeE^n ... r\E^ means that ...,Aj. are among the co-ordinates of x, thus 
Ai n... nA^4=A. 

.To prove ( 2 ) suppose that ^4^ \ A 2 contains a point a^. The closed sets and A 2 
(in R) are disjoint; thus the same is true for % == [PR)a^ and {^R) A^, i.e. the point 
%€ does not belong to (y?JS)A 2 , and ( 2 ) is established. By the same 

argument, to any distinct A 2 there correspond distinct {PR)A^, (y?i 2 ) A 2 . 

The hypothesis that the sets A are canonical has not yet been used: we have 
simply proved that the operator (y?J?) maps isomorphically the system of all closed 
sets of R into the system of all closed sets of ^R. It remains to show that: 

( 1 ) The operator {^R) transforms canonical closed sets of R into canonical 
closed sets of fiR. 

( 2 ) Any canonical closed set of yJJ? is the image under {^R) of a canonical closed 
set of R. 

Proof of (1). Lemma. Every open is dense in the corresponding 

In fact, let x = Take any neighbourhood 0^/ of x. From H'ex 

it follows that H r\E'^ A; the points oiH nH' being points of H belonging to 0^*, 
the lemma is proved. 


VoL 189. A. 


3 



34 P. Alexandroff 

Now let A be canonical closed in R; this means that A = R{U). Since U is dense 
in Ou, 

mu=(mou- 

But (yffi?) U = ifiB) (BU) = i^B) A ; thus {fiB) A = {^B) is canonical. 

Proof of (2). Let A = (fiB) U be canonical closed in fiB; suppose that U is open 
canonical in fiB. Let 

U — U r\ B. 

Since u is dense in U, then 

A = m u = mu=m {Buy, 

as Bu is canonical in B, the assertion (2) and thus theorem 6-10 are completely 
proved. 

Theorem 6*10 may be formulated as follows: 

Theobbm 6*1. The normal space B and its maximal extension have the same 
spectrum. 

Since the spectrum of a bicompactum is a Hausdorff spectrum (Kurosch 1935), 
there follows 

CoBOT.iiAUY 6-1. The spectrum of a normal space is a Hausdorff spectrum. 

The space defined by the spectrum of a bicompactum is (as shown by Kurosch 
1935) this bicompactum; this, together with theorem 6'1, gives a new and important 
construction for the maximal extension of a space: 

Theorem 6*2. The maximal extension of a space B is the bicompactum defined 
by the spectrum of the space B. 

Theorem 6 ’ 3 . The operator Ojy applied to the canonical open sets ?7 of the space 
B realizes a (1: l)-correspondence between the system of all canonical open sets 
of B and the system of all canonical open sets of /SB; this correspondence is an 
intersection isomorphism: 

( 4 ) ... nOjj^ = A if and only if U^r,... r\U^ —A. 

( 5 ) if and only if 

Proof. Le b e m a 2. For any canonical open set VzB: 

(6) 0 ^ = fiB\{fiB)(B\BU). 

Proof of Lemma 2. Let a: e Op. Then O' e *. Now {B\BU) is the set of all 
as' = {fla} whose coordiuates all intersect B'^BU; therefore xe^B\{fiB){B'^ BU). 
Thus, the inclusion Op£y?i 2 \ (yffiJ) {B \ JJO) is proved. Letare^^jR \ (fiB) {B \ BU); 
this implies the existence of an H ca: not intersecting B^BU, i.e. HzJBU = U, 
since U is canonical. From H £ O, H e a: it foUows that O e a:, a: e Op, which proves 
the inclusion /?JJ \ (jSB) {J? n jR O) £ Op and with it lemma 2. 

Betnark. For canonical U the set \ BU is dense in \ O, therefore 

{fiB){B\BU) = {m{li^U) and Op =/?J? \ (ySiJ) (iJ \ O). 



35 


On the dimension of normal spaces 

Since the canonical open and the canonical closed sets are complementary to each 
other, it follows jfrom lemma 2 together with theorem 6-10 that the operator O^j 
realizes a (1: l)-correspondence between the canonical open sets of B and those of 
fiR. In order to prove the isomorphic character of this correspondence we first 
recall (see Alexandroflf 1939) that 

... A Oxj^ = Oij^n..^nUr- 

Furthermore, from 0 ^^ ^ 0 ^^, 

there follow \ (fiB) {B \ BUj) ^ fiB \ (^B) (R'^BU^) 

and (JSR) (B \ RUj) ^ (^R) (R \ RU^), 

or (by the last Remark) (fiB) (R \ £4) ^ (y?i?) (J? \ ZJg)* 

Thus, since i? \ £4 B^U^ are canonical closed, 

i2\£43i2\C4, 

This completes the proof of theorem 6 * 3 . 

As an immediate consequence of theorem 6*3 there follows 

Corollary 6 * 3 ; dim fiR = dim B. 


(This result is known—see Wallman (1938), footnote 11 on p. 119 .) 

Since for spaces with the first countability axiom homeomorphism between 
and yffjRa implies homeomorphism between R^ and B^ we deduce from theorem 6*10: 


Theorem 6*11. Any normal space B satisfying the first countability axiom is 
completely determined by the combinatorial structure of the system of its canonical 
closed sets. 

More precisely: Suppose that there is a multiplicative isomorphism between the 
system of all canonical closed sets of R^ and the system of all canonical closed sets 
of B^; then the spaces R^ and iJg are homeomorphic. 

I do not know whether the analogous theorem is true for canonical open sets. 
Moreover, it is stiU unknown whether theorem 6*11 remains vaKd without the 
assumption of the first countability axiom. 

The foflowing Remark is evident. Let S = {ffj and S' = {£?«} systems of 

open sets, of the spaces B and B' respectively; a (1: l)-correspondence between 8 
and S' is called for the moment a regularity-isomorphism, if it preserves relations 
of the form H{n...nH; = A, 


Now suppose that there exists a regularity-isomorphism between the system of all 
open sets of B and the system of all open sets of R\ Since a regularity-isomorphism 
transforms in a (1: l)-manner the ends of the one space into the ends of the other a 
(1: l)-mapping of fiR on to jSR' is induced. Furthermore, this mapping transforms 
sets of the form O^^fiR into sets of the form Oj^^^jSB' and vice versa, and is 


3-2 



36 


P. Alexandroff 


therefore a homeomorpMsm between fiR and If R and R' satisfy the first 
countability axiom, a homeomorpMsm between ^B and fiB' implies a homeomor¬ 
pMsm between B and B'. Moreover, it is easily seen that any end is completely 
determined by its canonical elements, and that open sets of the form 0 ^ (U 
canoMcal) are sufficient for the definition of the topology in ^B. Thus one obtains: 

Theorem Q-i. Let the spaces B, R' satisfy the first countability axiom. Denote 
by O, U, O', U' the system of all open sets and of all open canonical sets of each of 
these spaces. If there exists a regularity-isomorphism between O and O' or between 
U and U', the spaces R and B' are homeomorpMc. 

Proof of Theorem 4 - 1 . Let dimJ? = n. Then dimy^i? = n\ thus by theorem 4-2 
for bicompacta the spectrum oi^R, i.e. the spectrum of R, has the structure described 
in theorem 4 - 1 , q.b.d. 

Proof of Theorem 4*2 (for a normal R). Let dim B = dim fiB = n. Then there 
exists in yffJS a canomcal closed set A such that V»(; 5 i 2 mod A) is different from zero. 
If we define the last group by means of the spectrum of the open canonical coverings 
of fiB and write (in the notations of §§ 1 and 2) 

V^i^B mod A) = lim (V^fj, tt^) 

with = A„, A^ = AC„, it is seen that the may be regarded as 

nerves of the open canonical coverings of J? as well as of fiB. Let furthermore 
A = ifiB) U,RnU = U', A' = BV'. Then A^ — A 'C^ (in the spectrum 0 of the open 
canomcal coverings of B). Therefore the above group lim (V^i, tt^) is at the same tune 
the group V»(J 2 modA'). Since lim (V«i,= V«(i?modA') is seen to be different 
from zero, theorem 4-2 is proved in its full generality. 


7. Gantoe manifolds 

Let G = be a Hausdorff spectrum. This spectrum is called n-dimensional 

if it satisfies the following condition: 

There exists a maximal subspectrum A of C such that the ^-dimensional Betti 
group of (7 mod A is different from zero, while for r >» and for any choice of the 

maxunal subspectrum A of O the r-dimensional Betti group of (7 mod A is the 
zero group. 

In this definition the infinite cyclic group J is taken as the coefficient-group for 
the V-case. For the A-case one has, of course, to take as coefficient-group the group 
J of an real numbers reduced mod 1; one can define in the same manner the dimen¬ 
sion of a spectrum with respect to any coefficient-group. 

A maximal ^bspeetrum A of the ^-dimensional spectrum 0 is caUed an (abstract) 
Canior rmmfold lying on C, if, whatever be the representation of A as the sum of 
two maximal subspectra (no one of wMch coincides with A), the intersection of 
to subspectra is at least (n- l)-dimensional. If, in particular, 0 is the spectrum 
of the nomal space B then a Cantor manifold lying on G is called an abstract Gantor 
mamfold lying on the n~dimensional space R, 



37 


On the dimension of normal spaces 

Remark 1. It is unknown if a Cantor manifold lying on an Wr-dimensional spectrum 
(in particular on an 7i-dimensional space) is itself an n-dimensional spectrum. This 
seems to me to be an outstanding problem in the general dimension theory of 
spectra. 

In conformity with the well-known definition given by Urysohn more than twenty 
years ago, let us now define a classical Cantor manifold lying on an n-dimensional 
bicompactum 5 as a closed set E^R having the property that for any representation 
of E as the sum of two closed proper subsets, the intersection of these subsets is at 
least l)-dimensionaL 

Remark 2 . It is again unknown whether a classical Cantor manifold lying on an 
7i-dimensional bicompactum R is itself an 72 ,-dimensional bicompactum: in the 
metric case the dimension of R (defined as in § 4 by means of coverings) is equal to 
the inductive dimension ind R (defined by the well-known Urysohn-Menger process 
of induction beginning with dim^l = — 1 for the empty set A), Now in the case of 
a bicompactum R it is only known that 

ind R ^ dim R 

(as I proved in 19416), while it still remains unknown whether the inequality 
indJS^dimiZ holds. Here again is an important problem of general dimension 
theory. 

It follows immediately from what was said at the end of § 3 that for an n-dimen- 
sional bicompactum there is a (1: l)-correspondence between the abstract Cantor 
manifolds lying on R and the classical ones. The purpose of this section is to prove 
the following; 

Theobem 7 * 1 . On any Ti-dimensional normal space {n being finite), there lie 
abstract Cantor manifolds. 

This theorem is first proved for a bicompact R. The proof will closely follow the 
proof given by Hurewicz & WaUman for the metric case. 

Lemma 7 * 1 . Let / and g be two mappings of the bicompactum R into the sphere 
^ 71-1 Suppose that the points in which g{x)^f{x) form a subset M^Roie, relative 
dimension ^n-~ 2 . Then / and g are homotopic. 

Proof, Following Hurewicz & WaUman, we define the closed set E in [R, I] 
consisting of aUpoints {x, 0) and {x, 1) with xeR, and aUpoints (x, ^).with xeR^M 
and (The set E is closed because M is obviously open.) The mapping 

f{x, t) of E into is defined as foUows; 

f/(a?, t) = fix) = g{x) for xeR^M, 

{fix, 0) = fix), fix, 1) = gix). 

The set [JJ, 7] \ jE? is contained in [ilf, /]. Thus by the coroUary to theorem 4 * 7 : 
rd([i?,/] \ jE?)< rd[if,I]<(71- 2 )+l = n- 1 . 

Therefore, by theorem 4 * 6 , the mapping/(a;, ^) defined on E can be extended over 
[jB, I], and this proves the homotopy of g and/. 



38 


P. Alexandroff 


Lemivia 7 ‘2. Suppose that the bicompactum R is the sum of two closed subsets 
and E^, Suppose that A,/2 are mappings into of E^, E^ respectively, and that 

the points (in E^ n E^) in which+/2(ic) form a set of relative dimension 2 , 
Then A can be extended over R. 

Proof. The mappings A and /2 considered as mappings of the bicompactum 
Ej^ n E^ are homotopic by lemma 7 - 1 . Since (as a mapping of E^ n E^ into 8 ^-^) 
is extendable over E^ (to the mapping/2) it follows from the Borsuk lemma 5 * 1 that 
there is an extension Ag of A (considered on E-^ n E^ over E<2.. Defining 

on A 

l/(^)=/i2(^) on E^ 

we obtain the desired extension of A over R is obtained. 

Lemma 7 * 3 . Let there be given a bicompactum iJ, a closed set i? and a map¬ 
ping f oi E into a sphere 8 . Suppose, forthermore, that there is in i? a decreasing 
well-ordered system of closed sets E^ (the A are finite or transfinite ordinals). 
Suppose that / admits no extension over a set of the form EuE^; then / admits no 
extension over E u E\ where JS' = ^ E^- 

The easy proof of this lemma is as follows. Suppose that / is extendable over 
E V E\ Then by the Brouwer-Urysohn extension theorem / can be extended over 
a certain neighbourhood JJ oi E \j W m R. Since R is bicompact, all E^^ with a 
sufificiently great A are contained in U, thus / would be extendable over some 
E \j E^^ contradicting our hypothesis. 

Brom lemma 7-3 follows: 

Lemma 7 * 4 . Given a bicompactum J?, a closed set E<=^R and a mapping / of E in 
a sphere S which cannot be extended over R, there is a closed E'^R such that 
/ cannot be extended over Eu E' but, for any E" c E\ admits an extension over 
EyjE\ 

It is now easy to prove theorem 7*1 for a bicompactum R. 

Let dim R = n. Then there exists a closed E<=^R and a mapping f of E into 8 ^^^^ 
which is non-extendable over R. By lemma 7*4 there is in iJ a closed E' such that 
/ is extendable over EvjE'" if E"cz E\ but is not extendable over EvjW. I say with 
Hurewicz & WaUman that this E' is a classical Cantor manifold on the bicom¬ 
pactum jB. Otherwise, let 

W=^E[uE'^, + dim(J?in^')^^-2, 

By the definition of E' the mapping / can be extended over E^=: Eu E^{i = 1 ,2 ), 
to a mapping/,., and the points in which A possibly differs from/g are all rnE'^nE'^ 
and form therefore a point set in E-^riE^ of relative dimension not exceeding 
n — 2 (with respect to the bicompactum EuE' = E-^kjE^). It follows by lemma 
7*2 that A, and thus /, admits an extension over E u E\ contradicting the choice 
ofE\ 



On the dimension of normal spaces 39 

Now let R be an arbitrary Ti-dimensional normal space. Its spectrum, being at 
the same time the spectrum of an y^-dimensional bicompactum, namely, of fiR^ 
contains a Cantor manifold which by its definition lies on JS. Thus theorem 7*1 is 
proved in its full generality. 


Refebences 

Alexandroff, P. 1932 Mdth, Ann. 106, 161-238. 

Alexaaadroff, P. 1939 Math. Sbomik. 5, 403-423. 

Alexandroff, P. 1940 G.R. Acad. Sci. U.R.S.S. 26, 619-622. 

Alexaudroff, P. 1941a Trans. Amer. Math. Soc. 49, 

Alexandroff, P. 19416 Mitt. Akad. Wise. Georgischen S.S.R. 2, 1-6. 

Alexandroff, P. 1943 Trans. Amer. Math. Sac. 54, 286-339. 

Alexandroff, P. & Hopf, H. 1935 Topologie. Berlin: Springer; also New York: Dover Publ. 
Alexandroff, P., Hopf, H. & Pontrjagin, L. 1935 Gomp. math. 4, 239-255. 

Fomin, S. 1943 Ann. Math. 44, 471-480. 

Hurewicz, W. & Wallman, H. 1941 Dimension theory. Princeton Univ. Press. 

Kuroseh, A. 1935 Gomp. math. 2, 471-476. 

Lefschetz, S. 1942 Algebraic topology. New York: Amer. Math. Soc. 

Wallman, H. 1938 Ann. Math. 39, 112-126. 


The crystal structures of two polyamides (‘ nylons ’) 

By C. W. Btjnn aud E. V. Gakneb 
Research Department, I.G.L Ltd., Alkali Division, Northwich, Cheshire 

{Communicated by Sir Lawrence Bragg, F.R.S.—Received 6 May 1946 
—Read 20 March 1947 ) 

[Plates 1 and 2] 


Detailed iaterpretations of the X-ray diffraction patterns of fibres and sheets of 66 and 
6.10 polyamides (polyhexamethylene adipamide and sebacamide respectively) are proposed. 
The crystal structures of the two substances are completely analogous. 

Fibres of these two polyamides usually contain two different crystalline forms, a and 
which are different packings of geometrically similar molecules; most fibres consist cMefiy of 
the a form. In the case of the 66 polymer, fibres have been obtained in which there is no 
detectable proportion of the p form. 

Unit cell dimensions and the indices of reflexions for the a form were determined by trial, 
using normal fibre photographs, and were checked by using doubly oriented sheets set at 
different angles to the X-ray beam. The unit cell of the a form is triclinic, with a = 4-9 A, 
b = 5*4A, c (fibre axis) = 17-2 A, a = 48J®, ^ = 77®, y = 63i® for the 66 polymer; a = 4*95A, 
b = 5*4A, c (fibre axis) = 22*4A, a = 49®, P = 76J°, y = 63for the 6.10 polsnner. One 
chain molecTile passes through the cell in both cases. 

Atomic coordinates in a crystals were determined by interpretation of the relative in¬ 
tensities of the reflexions. The chains are planar or very neaarly so; the oxygen atoms appear 
to lie a little off the plane of the chain. The molecules are hnk^ by hydrogen bonds between 
0=0 and NH groups, to form sheets. A simple packing of these sheets of molecules gives 
the ct arrangement. 



40 


C. W. Bunn and E. V. Gamer 


An alternative packing of the sheets gives a two-molecule triclinic cell, and this is the 
structure proposed for the p form. 

Streaks on the layer lines (but not on the equator) of fibre photographs are attributed to 
an arrangement of molecular sheets in which a and ^ groupings occur at random. 

The optical properties of fibres and doubly oriented sheets are consistent with the structure 
determined by the X-ray method. Unoriented specimens are spherulitic; optical evidence 
on the structure of the spherulites is discussed. 

Some striking similarities between the crystallography of the polyamides and that of the 
fibrous proteins are discussed. 


Polyamides of high molecular weight have attained considerable practical import¬ 
ance as'nylon' fibres. The substance most widely used in this way is the polyamide 
made by the condensation of hexamethylene diamine, NH2(CH2)eNH2, and adipic 
acid, H 00 C(CH 2 ) 4 C 00 H, and known as ‘ 66 ’ polymer. (In the abbreviated nomen¬ 
clature for the polyamides made from diamines and dibasic acids, two numbers are 
given, the first being the number of carbon atoms in the diamine and the second 
the number of carbon atoms in the dibasic acid. Thus, the polymer made from 
hexamethylene diamine and sebacic acid, HOOC(CH2)8COOH, is known as ‘ 6.10 ’). 
In the attempt to correlate the physical properties with molecular structure, interest 
centres on the high tensile strength of the fibres (which in this respect resemble 
natural silk) and on the high melting-points of the polyamides as compared with 
polyethylene (66 nylon melts at 250 ^ 0 , polyethylene at 115 ° C). It is natural to 
suppose that these features are due in some way to forces between the polar groups 
in the molecules—one expects to find hydrogen bonds between the C ==:0 groups of 
one molecule and the NH groups of its neighbours; this expectation has been 
realized in the present work, in which 0 —N distances characteristic of hydrogen bonds 
have been found. The polyamides are also interesting on account of their relation¬ 
ship to the proteins: the grouping —CO—NH— is present in both types of sub¬ 
stances, and any information on molecular packing gained from a detailed study of 
the structures of polyamides may be useful in dealing with the far more complex 
problems presented by the fibrous proteins. The expectation of finding some 
structural features common to polyamides and fibrous proteins has, in fact, been 
strikingly justified in the present work, which was finished in 1941 . 

The two polymers studied here by X-ray methods are those already mentioned— 
the 66 and 6.10 polyamides. It seemed likely from the start that the crystal structures 
of these two substances were analogous, and it soon became evident that the 
detailed interpretation of the X-ray diflfraction pattern of either polymer by itself 
was likely to be very difficult. Both substances were, therefore, studied concur¬ 
rently; hypothetical interpretations of particular features were applied to both, 
and correlation of the results of such tests not only eased the task of interpretation, 
but led to increased confidence in the results. It is doubtful whether, if only one 
polymer had been studied, it would have been possible to get as far as a detailed 
solution of the atomic structure such as those proposed in this paper; and certainly, 
any proposed interpretation would have been regarded with far more suspicion. 
The difficulties and ambiguities of the interpretation of X-ray diffraction patterns 



41 


The crystal structures of two polyamides nylons') 

of high polymers are such that this procedure—examination of two or more struc¬ 
tures which promise to be analogous—^is of great value, and should be followed 
wherever possible. 

Chain configuration. All the X-ray difiraction photographs used in the present 
work were taken by copper JTa radiation. Those used for the detailed interpretation 
in terms of atomic structure were taken on cylindrical films, in order to obtain as 
wide an angular range of reflexions as possible. Drawn fibres of the 66 and 6.10 
polymers give diffraction patterns showing well-marked layer lines; for structure 
determmation, it is necessary that the crystals in the fibres shotild be as large and 
perfect as possible, giving sharp reflexions; and the orientation should be as perfect 
as possible, so that the arcs are as short as possible. Fuller, Baker & Pape ( 1940 ) 
have shown how this can be achieved by suitable thermal treatment. One of the 
best photographs obtained is shown in figure 6 , plate 1 ; those of the 6.10 polymer 
were not quite so good as those of 66 . 

The repeat distances for the two polymers along the fibre axis, given approximately 
by the layer line spacmgs but determined more accurately later on in another way, 
are 17*2 A for the 66 , and 22*4 A for the 6.10 polymer. It has already been pointed 
out by Fuller ( 1940 ) that, if it is assumed that the long molecules are parallel to 
the fibre axis, these figures suggest that the chains must be fully extended, or very 
nearly so: they must be plane or very nearly plane zigzags: 

0 0 

o 

I-17.2A-1 

* 

Any considerable rotation round any of the bonds to give a non-planar chain would 
shorten the repeat distances. Thus, for these polymers there was, from the start, 
very little doubt about the chain configuration; the problem to be solved was one 
of molecular packing—^the orientation of the zigzag planes and the registration of 
neighbouring molecules along the fibre axis. 

Unit cell. One dimension of the unit cell, which will be called the c axis (the 
‘height’ of the cell), is the same as the molecular repeat distance just mentioned— 
17-2 A for the 66 polymer and 22-4 A for the 6 . 10 . The dimensions of the base of the 
cell, projected on to a plane normal to the c axis, are obtained from the positions 
of the spots on the equators of the photographs. It is important to note that both 
the positions and the intensities of these equatorial reflexions are almost exactly 
the same for the two polymers, a fact which suggests that the molecules are packed 
side by side in almost exactly the same way in the two polymers. The simplest 
projected cell-base which fits the positions of the equatorial reflexions has edges 
4-03and 4-77 A* with an angle of 66 ° 15"' between them {ABCD in figure 1 ); there 

* The revised X-ray wave-lengths given by Lipson & Biley ( 1943 ) have been used. Spacings 
given are in Angstrom units, riot kX units. 



42 


C. W. Bunn ajid E. V. Gamer 


can be only one chain molecule passing through this cell, for the distance between 
unbranched chain molecules packed side by side is always of the order of 4 A. 

The first check of unit-cell dimensions is given by a comparison of calculated with 
observed density; and it can be made at this stage (that is, before the other angles of 
the unit cell are considered), since the volume of the cell is equal to the product of 
the height and the area of the projected cell-base. The volume of the cell is, for the 
66 polymer, 301 x 10”^e.c., the weight of one monomer unit 373 x 10“®*g., hence 
the density would be 1-24. The measured density of drawn fibres of 66 polymer is 
1-14 to 1-16. The measured densities of crystalline polymers are usually lower by 
5 to 10 % than the true densities of the crystalline regions, owing to the presence of 
amorphous material of lower density; hence the density of the crystalline regions 
of 66 polymer would be expected to be about 1-20 to 1-25; the density calculated for 
the proposed unit-cell dimensions is in satisfactory agreement with this figure. 


C 



Figube 1 . Alternative cell-bases projected on a plane perpendicular to c (fibre axis). 

To find the other angles of the cell, it is necessary to consider the positions of all 
the other spots on the photographs; and in doing this, it is necessary to take into 
account the possibility that the true projected cell-base might be larger than that 
which fits the equatorial spots by themselves; in particular, it happens that the 
two-molecule cell ABEF in figure 1 is rectangular within the limits of error of 
measurement, and this may mean that the cell is an orthorhombic or monoclinic 
two-molecule cell which is apparently centred when seen along the c axis. 

These possibiliti^, and many more, were tested, using reciprocal lattice rotation 
diagrams (see Bernal 1926 ), but it was found impossible to account for the whole of 
the spots by any simple orthorhombic or monoclinic cell for either of the two poly¬ 
mers; only improbably large cells having these symmetries would account for all 
the spots. Triclinic cells were then considered; and in doing this, attention was first 
confined to the 7th layer line of the 66 polymer. The reflexions on the 7 th layer line 
are much stronger than those on other layers (apart from the equator), and there is 
thus a subperiod of c/7; this is evidently due to the fact that there are in the repeating 
unit seven zigzags, which to a first approximation can be regarded as equivalent. 
The procedure of confining attention to the 7th layer fine is equivalent to reducing 
the repeat distance of the molecules to c/7 and considering the side-by-side pa nlrmg 



43 


The crystal structures of two polyamides {"nylons') 

of simple zigzag chains. Moreover, the positions and relative intensities of the 
reflexions on the 9th layer line of the photographs of 6.10 polymer are very similar 
to those of the 7th layer line of 66 polymer— Bu fact which is further evidence that the 
packing of the zigzags is the same in the two polymers, and which means that the 
procedure just mentioned should lead to a small pseudo-unit cell valid for both 
substances. It was found that the positions of these reflexions could be explained 
by a one-molecule triclinic cell. 

The reflexions on the remaining layer lines are chiefly due to the oxygen atoms of 
the C=0 groups, together with some effects due to small inequalities of the different 
zigzags in the repeating units. In considering the transition from the small pseudo¬ 
unit cell just mentioned to the true unit ceU, it is necessary to take into account 
the different possibilities arising from the attachment of oxygen atoms to alter¬ 
native positions on neighbouring chains; thus, in two dimensions, two possibilities 
are illustrated in figure 2 . In three dimensions there are various possible unit cells, 
aU compatible with the small pseudo-unit, but all could be ruled out except one, 
and this accounted well for all the spots on the photographs of the 66 polymer, 
except for a row of weak reflexions very near the meridian. (On most photographs, 
these spots are joined to other spots by streaks along the layer lines, and they may 
not be distinguished from such streaks; but suitably annealed specimens show them 
clearly—see figure 3L in Fuller et ah ( 1940 ); and pressed specimens, suitably 
oriented, show them more strongly and clearly—^see our figure 9. The interpretation 
of the streaks will be considered later.) 



Figube 2 



On considering the merits of a corresponding unit ceU for the 6.10 polymer—a 
cell, that is, with the same angles but different height—^it was again found that aU 
the spots on all the layer lines fitted, except a row of weak spots near the meridian. 
This seemed highly significant, for individual spots on the 6.10 photographs are 
(except for those on the 9th layer) in quite different positions from those on the 66 
photographs; but the interpretation could only be correct if it could be assumed that 
the weak spots near the meridian are due to a second crystalline phase present in the 








44 


C. W. Bunn and E. V. Garner 


specimens of both polymers. This second crystalline phase would be simply a different 
packing of the same molecules (the repeat distance being exactly the same as in the 
main crystalline arrangement). 

This is not the first occasion on which polymorphism of this type has been sug¬ 
gested to account for the X-ray diffraction patterns of long-chain polymers. Fuller & 
Frosch ( 1939 ) suggest that fibres of certain polyesters contain two different crystal¬ 
line forms which can be derived from each other by simply displacing the chains 
parallel to each other (see figure 3 ). This is essentially the same as Schoon’s suggestion 
( 1938 ) to account for the various polymorphic forms of the shorter-chained hydro¬ 
carbons and acids (20 to 40 carbon atoms). The mode of side-by-side packing of the 
zigzag chains is the same if the displacements {d in figure 3) are exact multiples of 
the zigzag period 2*5 A, and displacements of this magnitude do account for the 
known unit cells. Various forms of cell—orthorhombic, monoclinic and tricKnic— 
are obtainable by suitable displacements of the chains. 



0 

t 

0 

-- 

t 

0 

t 

t 

/ 

t 

/ 


- 7 

0 

/ 

0 



V 


1 

d 

j 


1 

(i 

J 



znonoclinic 

orthorhomfaio 


monoclinic 
(2nd type) 


triclinic 


Figure 3. Polymorphism of chain compoxinds. 


The hypothesis of polymorphism was accepted provisionally, and evidence in 
favour of it accumulated gradually, as the interpretation became more detailed. 
But we shall anticipate events by sayir^ that, after the complete interpretation 
presented here had been worked out, a specimen of the 66 polymer was obtained 
which showed no trace of the spots ascribed to the second crystalline form (see 
figure 10 , plate 1 ). The specimen was prepared by immersing a taut fibre ( 0-6 mm. 
diameter) for 10 min. in a 7 % solution of phenol in water, then for 2 hr. in boiling 
water, the phenol being removed by drying in air at 66 ° C for 3 days. This specimen 
evidently consisted of a single crystalline species; had it been available earlier, it 
would have simplified the process of interpretation. But its discpvery strikingly 
vindicated the whole scheme of interpretation which had by that time been worked 
out. 




45 


The crystal structures of two polyamides {^nylons'") 

The crystal form which is predominant in most nylon specimens will be called 
the a form. The dimensions of the triclinic unit cells of the a form of both polymers 
are shown in figure 4. The second crystal form, which gives the row of spots near the 
meridian of the fibre photograph, will be called the ^ form. 



FiGimE 4. Dimensions of tmit cells of 66 and 6.10 polyamides. 

The next step was made possible by the discovery that a double orientation of the 
crystals can be obtained by pressing or rolling nylon fibres. In a fibre, all the crystals 
have their c axes parallel to each other, but in other respects are disposed at random; 
but in a pressed or rolled specimen, the crystals not only have their c axes parallel, 
but also have one particular crystal plane approximately parallel to the plane of the 
sheet. In a fully drawn fibre cold-rolled in the direction of the fibre axis, the c axes 
of the crystals remain parallel to the fibre axis, but if a long undrawn fibre is pressed, 
the c axes become perpendicular to the original fibre axis. The c axes set themselves 
along the direction of greatest flow during pressing. There is some departure from 
strict parallelism, especially for the second (plane) orientation. An X 7 ray photo¬ 
graph taken with the beam passing down the c axis (figure 7, plate 1) showed that 
the plane which has been called 010 of the unit cell is parallel to the plane of the 
sheet; the same photograph confirmed the angle between the 010 and 100 planes 
of the cell. 



46 


C. W. Biinii and E. V. Gamer 

When a doubly-oriented specimen is set at any particular angle to the X-ray 
beam, the pattern produced is similar to that which would be given by a single 
crystal or a tinned crystal oscillated over a small angle, this angle correspon ding 
to the dispersion of crystal orientation in the specimen. For the polyamides four 
(^erent orientations of the unit cell in the specimen, all of them consistent with 
the restrictions mentioned (one axis common and one plane common), are possible 
(figure 5), and the specimen thus simulates, not a single crystal, but a quadruplet. 
A photograph of such a specimen placed at a particular angle to the X-ray beam 
shows certain refiexions only, and the intensities of them are not the same as in a 
fibre photograph (see figure 8, plate 1). By observing the specimen setting at which 
each reflexion is recorded at maximum intensity, it is possible to deduce the approxi¬ 
mate onentatiom of all the reflecting planes with respect to each other and to the 
axes of the specimen. The graphical method which was used for interpreting these 
photo^aphs IS essentially Bernal’s reciprocal lattice method (see Bernal 1026) 
suitably extended to deal with the fourfold twinning in the specimens. 

AU the reflexions appeared at the expected angles for both polymers. It must 

e emphasized that this test of the proposed unit cell is a severe one, and the result 
o It leav^ very little doubt that the proposed triclinic unit cell for the a form is 
coraect; :^om these photographs it was also possible to distinguish between alter¬ 
native mdic^ for certain spots. The photographs also indicated the orientations of 
^e principal reflecting planes of the ^ form: the reflexions are strongest when the 
X-ray beam is parallel to the plane of the sheet, as in figure 9, plate 1 

“>7 proposed poijmer stoueture is 
*'“• pooitiom of fte atoms in the eeU 
^ tnal m an attempt to obtain agreement between calculated and 

observed mtensities. (Direct or semi-direct Fourier series methods of finding or 

are used whenever possible for single-cr/stal 
photographs, are not practicable for most polymers.) 

In the teclinic system, o^y two space-group symmetries are possible—P1 with 

^ symmetry. Molecules of 66 and 6 10 

l T'r of symmetry, and assuming that tlfl 

Is SI arrangement (in this one-molecule cell, no way of avoiding 

Its utilization is apparent), the space-group symmetry is PT. 

Ignoi^ hydrogens, whose positions cannot be found by X-ray methods there 

m r ™ “PP^^te fo™ of the moleoJle^ 

P W at ‘’‘T® T"* . »Uo lying in or ne« this 

ae proUem of ^ * atomic cotordinates therefore resolves itself into 



47 


The crystal structures of two polyamides nylons^) 

sharp, detailed photographs of single crystals are available. The limitations of 
polymer photographs are such that we must be content with accepting, for the most 
part, interatomic distances derived from other work, and moving and distorting the 
molecules without departing greatly from the accepted interatomic distances. This 
limitation in the interpretation of polymer photographs is not really a serious one. 



Figubet 5. Four orientations of cell in doubly oriented specimens. 
Optical vibration directions also shown. 



48 


C. W. Bunn and E. V. Garner 


inasmuch as most interatomic distances, such as C—C, 0—C=0, are well 
established and are not likely to vary appreciably; the only circumstances in which 
interatomic distances are in doubt are those in which bond character is in doubt. 
Thus, in the polyamides, the carbon-carbon links are undoubtedly single, and their 
length can be assumed confidently to be 1*53 A; but the group usually written 
—C—^NH— (amide form) may actually exist as C=N— (imidol form). The carbon- 

II I 

0 OH 

oxygen and carbon-nitrogen distances are on this account uncertain; C—0 is 
usually 1-43, 0=0 1-25, C!—(in amides and amino-acids) 1-39, C=N 1-28. The 
procedure adopted was to keep the 0—0 bonds, at 1*53 ±0-03, but to treat the 
nitrogen and oxygen atoms semi-independently, moving them about in the cell 
regardless of interatomic distances within the wide limits between single- and 
double-bond distances. The diffracting powers of C, CHg, NH and 0 were assumed 
to be in the ratios of the numbers of electrons 6 : 8 : 8 : 8 , and the decrease of diffracting 
power with angle of diffraction to be as given for carbon by Cox & Goodwin ( 1936 ). 

The approximate orientation of the molecule in the unit cell is suggested by the 
cell dimensions. The distance between chains A and D in figure 1 ( 4-77 A) is greater 
than that between A and B (4-03 A), and therefore the plane of the zigzag chain is 
expected to be in or near the ( 010 ) face of the cell. Furthermore, the angle between 
the a and c axes is 77"^, and this is just the angle to be expected if the molecules are 
lying with their planes in or near ( 010 ) and with the oxygen atom of one molecule 
on the same level as the NH of the next, as it would be if there is a hydrogen bond 
between them (see figure 2 a). 

Calculations of the relative intensities of the equatorial (hJcO) reflexions and 
subsequently of the whole of the reflexions confirmed this, and defined the orienta¬ 
tion more closely. In the calculations, the objective was to obtain relative intensities 
of the same order of magnitude as those observed on the photographs, the latter 
being estimated by eye and classified as strong, medium, weak and so on; for the 
interpretation of polymer photographs it is, generally speaking, scarcely justifiable 
to attempt more than this. The limitations are well brought out by experience in the 
present work, for it must be admitted that entirely satisfactory agreement between 
observed and calculated intensities, even within this modest scope, could not be 
attained, and it appeared that no adjustments of atonuc coordinates could possibly 
remove aU the discrepancies. This may mean, of course, that the whole scheme of 
interpretation is wrong; but against this it must be urged that, apart from the 
difficulty of finding the umt-cell dimensions (and the results here can now scarcely 
be doubted m view of the checks provided by the doubly-oriented specimens), the 
structure is a simple one, and in a one-molecule cell with only eight atoms to place 
and all of them closely limited in position by accepted interatomic distances and 
angles, the possibilities of adjustment are limited, and it can be said with more 
confidence than usual that the discrepancies cannot be removed by any adjustments 
of atomic positions. We shaU return to these discrepancies later, and merely observe 



The crystal structures of two polyamides nylons^) 49 

here that they are systematic, that they are of the same type in both 66 and 6.10 
polymers, and that reasonable explanations have been found for them. 

Accepting these systematic discrepancies, the best agreement between calculated 
and observed intensities was obtained by placing the plane of the chain nearly 
parallel to the 120 plane of the crystal, but with the oxygen atom pushed very 
nearly back to the 010 plane (see figure 12)—^that is, appreciably out of the plane of 
the chain. In view of the uncertainties of interpretation, it may be doubted whether 
this feature is genuine; but in its favour it must be emphasized, first, that any other 
configuration gives a poorer agreement between calculated and observed intensities 
(involving non-systematic discrepancies), and secondly, that the determination of 
the chain orientation is largely independent of that of the position of the oxygen 
atom: the intensities of reflexions on the equator and on the 7 th layer for 66 polymer 
(the 9 th layer for 6 . 10 ) are determined mainly by the positions of the chain atoms^ 
while the intensities of reflexions on all the other layer lines are determined mainly 
by the positions of the oxygen atoms. 



Figttbb 12. Structure of 66 and 6.10 polyamides, c projection. Arrows indicate 
movements of oxygen atoms when molecules rotate round their long axes. 

Very similar results were obtained for the two polymers with which this work is 
concerned. Thus the structures are completely analogous; they are completely 
defined by the following figures. (The parameters given for the 66 polymer imply 
small distortions of the chain from the strictly planar form; but it is doubtful 
whether these are significant. The 6.10 parameters are those of a strictly planar 
chain.) 

66 polyamide, a crystals 

Unit cell a= 4-9 A a=48J® Space group PT. 

6= 5-4 A 

c = 17-2A r=63i° 


Atomic coordinates in fractions of unit cell edges: 



-0-083 
+0-041 
+ 0-037 

CH, (2) 

+ 0-120 
-0-040 
+ 0-115 

c 

-0-088 
+ 0-010 
+ 0-186 

0 

-0-371 
+ 0-010 
+ 0*201 

N 

+ 0-058 
-0-019 
+ 0-253 

CH,(3) 

-0-131 
+ 0-095 
+ 0-313 

CH*(4) 

+ 0-089 
-0-039 
+ 0-392 

CHa (6) 

-0*106 
+ 0*066 
+ 0*458 


Vol. 189. A. 


4 



50 


C. W. Bniin and E. V. Gamer 


6AO polyamide, a crystals 



Unit cell 

a= 4-95 A a = 49° 

Space group PI. 





6= 5*4 A /?=76J° 
c = 22-4A r = 63i° 





Atomic coordinates: 






CH, (1) 

+ 0-102 

CH2{2) -0*102 

CHa (3) 

+ 0-102 

N 

-0-083 

-0-046 

+ 0-046 


-0*046 


+ 0-037 


+0-029 

+ 0-080 


+ 0-138 


+ 0-187 

C 

+ 0*102 

0 +0-358 

CH, (4) 

-0-102 

CHa (6) 

+ 0-102 


-0-046 

-0-022 


+ 0-046 


-0-046 


+ 0*242 

+ 0-228 


+ 0-296 


+ 0-368* 

CH» (6) 

-0-102 

CH2(7) +0-102 






+ 0*046 

-0-046 






+ 0-411 

+ 0-473 






The best values for the lengths of the c axes were obtained from photographs of 
specimens rotated about a direction perpendicular to the c axis (figure 18, plate 2). 
For each polymer a strong reflexion was produced from a plane almost exactly 
perpendicular to the fibre axis (1.3.14 for the 66 polymer, 1.3.18 for 6.10). The 
spacing of this plane is equal to the length of the c axis. 

The complete stracture of one unit cell of the 66 polymer is shown in figure 13, 
and a view along the c axis in figure 12. 

Thermal vibrations and crystal distortions. The observed and calculated intensities 
are compared in an appendix, and a graphical comparison for the 66 polymer is 
shown in figure 14. It will be seen that there are some discrepancies. Taking the 
average correlation of calculated and observed intensities as standard, it is noticeable 
that for reflexions from the 010 plane and from all planes making a small angle with 
010 (that is, all planes in which Tc has a high value in comparison with Ji, such as 020, 
120, 122, 124, 126 in the case of 66 polymer), the calculated intensities are much 
larger than the observed, while the opposite is true for planes approximately 
perpendicular to 010 (such as 310, 212). No adjustments of atomic parameters can 
remove these discrepancies. The same is trae for the 6.10 polymer. The fact that the 
discrepancies are for the most part systematic and of the same type for both polymers 
suggests that they are not due to incorrect parameters, but to some general feature 
of the structure. 

Two possible explanations of these discrepancies have been considered, and it is 
likely that both play some part in producing the observed effects. One is that thermal 
vibrations of the molecules are strongly anisotropic. Vibrations of a triclinic structure 
are certain to be anisotropic, and in the case of the present proposed a-polyamide 
structure, in which the molecules are held together much more firmly in the 010 
plane (by the hydrogen bonds—^see below) than in other directions, it is hkely that 
thermal vibrations will be greatest in the direction normal to the 010 plane and least 
in this plane; as a consequence, the intensities of reflexions from 010 and all planes 
making a small angle with 010 will be reduced in comparison with those from other 
planes. Many different types of thermal vibrations are, of course, possible in a struc- 



51 


The crystal structures of two polyamides {^nylons') 

ture of this type—sinuous movements of the chain molecules, torsional oscillations 
of the molecules about their axes, as well as vibrations of individual atoms. These 
dMerent types of movement will affect the intensities in somewhat different ways, 
but in general the effects will be those already stated, which correspond with those 
actually observed. 



Torsional oscillations of the molecules about their axes may be responsible for the 
fact that the distances between carbon atoms in neighbouring molecules are not all 
the same. Those between atoms in molecules 2 and 4 in figure 12 (4-3 A) are per¬ 
ceptibly greater than those between atoms in molecules 1 and 2, or molecules 2 and 


4-2 


















52 


C. W. Bunn and E. V. Garner 


3 (3-9 to 4-0 A). The sweep of the oxygen atoms of molecule 2 would tend to push 
molecule 4 away, without affecting the distances between 1 and 2 or between 2 and 3. 



Figxjbe 14. Comparison of observed and calculated intensities. 66 polyamide, a form. 

Observed . Calculated fi 4 , Where photographs of doubly oriented specimens 

. . 212 
distinguish between alternative indices, verdict is shown thus ——• 

212 122 

The second possible explanation of the discrepancies in the intensities is that the 
crystals are likely to be more distorted in the direction normal to the b axis than in 
other directions. This again is a consequence of the layer formation of the crystal 
structure. Each layer of molecules would be flexible in directions perpendicular 
to the layer (figure 15), but molecules would be less easily distorted in the plane of 
the layer, owing to the much stronger forces between them. The effect on the intensi¬ 
ties would be similar to that of the anisotropic thermal vibrations, and it does not 
seem possible to decide to what extent each of the effects is responsible for modifica¬ 
tion of the X-ray intensities. 

Since the discrepancies can be explained in a reasonable way, it is considered that 
the structures proposed are acceptable interpretations of the X-ray data. In 



53 


The crystal structures of two polyamides {^nylons'") 

comparing the discrepancies with those encountered in an earlier case in which 
such effects have been discussed (Bunn 1939 ), it should be noted that the difference 
is that whereas in polyethylene all the hhO reflexions are weakened to the same extent 
(because the lateral forces are much the same in all directions), in nylon the OfcO 
reflexions and those from nearby planes are weakened much more than AGO and 
nearby reflexions. Whether (and to what extent) AGO reflexions of nylon are weakened 
in comparison with those from planes normal to c—^an effect which would be expected 
owing to the shape of the electron cloud of the CHg group (Bunn 1939 )—^we do not 
know. It is difficult to make a reliable comparison of these reflexions on a fibre 
photograph, and on a powder photograph there is too much overlapping of other 
reflexions. 



15. Distortion of crystals. 


Structure of ^ form. The amount of information available for deducing the mole¬ 
cular arrangement in the f form is meagre; nevertheless, it has proved possible to 
draw certain definite conclusions about it. 

The single set of spots available is a set of successive orders of reflexion by one 
particular plane, which is nearly at right angles to the molecular length (the c axis); 
the spots are evidently OGl, G02, etc., of the j3 crystals. The spacings of these spots 
show that the molecular identity-period (the c axis) is indistinguishable from that 
of a crystals; the chain structure in crystals is thus the same as in the a form. The 
difference is in the positions of the molecules along the c axis. The state of affairs is 
similar to that in certain polyesters (see figure 3), and contrasts with that in gutta¬ 
percha, which can be obtained in two different crystal forms consisting of geo¬ 
metrically different molecules (Bunn 1942 )- 

The intensities of the spots on the set of photographs of doubly-oriented specimens 
make it possible to define approximately the orientation of the OGl plane. This 
orientation is such as to suggest a cell having an ac angle of about 77° (as in the 
a form) and a be angle about 90° (contrasting with 48|° for the a form). The angle of 
77° suggests that in the f form there are sheets of molecules joined by hydrogen 
bonds in the 010 plane, just as in the a form; the only difference between the two 
crystal forms would thus be in the relative positions of successive sheets of molecules. 
In the a form, if we take one particular sheet as the standard of reference, with 





54 


C. W- Bunn and E. V. Gamer 


the c axis vertical as usual, the next sheet is not on the same level as the first, but 
is displaced upwards (in our setting) by 3*65 A, this displacement defining the be 
angle as The third sheet is displaced in the same direction by the same amount, 

and so on (see figure 16). Since in the postulated ^ ceU the be angle is about 90°, it 
appears either that all the layers are on the same level, or else that there is a scheme 
of successive displacements, some up and some down, the net result being an eventual 
return to the same level. The simplest assumption is that all the layers are on the 
same level, in which case the unit cell would be a triclinic one-molecule cell. The 
relative intensities of the OOZ reflexions are, however, such that this arrangement 
is out of the question; if aU the molecules were on the same level, the relative OOZ 
intensities would be practically the same for the /? form as they are for the cc form. 
In fact they are not; for 66 nylon, a 002 is very much stronger than a 001, whereas 
^ 002 has about the same intensity as ^001 (see figure 9, plate 1). 


4) y' 


o 


0 


A 9 


0 


0 


0 




0 


9 LI ... 9 , 


9T 


o 

I 


(X 




PiGTjEUE 16. Arrangement of sheets of molecules in a and p crystals. 
Lines represent chain molecules, circles oxygen atoms. 


The next possible structure (in order of simplicity) is one in which the second 
layer is displaced upwards with regard to the first, and the third is on the same level 
as the fitrst (figure 16y?). This arrangement has a triclMc two-molecule cell. It was 
found that such an arrangement does give correct relative intensities for the OOZ 
reflexions, if the layer displacement is about 3-0 to 3-5 A (alternately up and down); 
moreover, the intensities of the p spots of the 6.10 pol 3 rmer demand this same layer 
displacement (for figures, see Appendix). Note especially the figure for the layer dis¬ 
placement (3-0 to 3-5 A) which is about the same as in the a form (3-65 A); it appears, 
in fact, that the only difference between a and p crystals is that in the oc form 
successive layers are displaced always in the same direction by a fixed distance 
3-55 A, whereas m the p form successive layers are displaced alternately up and down 





55 


The crystal structures of two polyamides nylons'') 

by the same distance. This coirespondence between the suggested a and ^ stractures 
(the relation being the same in both 66 and 6.10 polyamides, and thoroughly in line 
with the polymorphism of other chain compounds) is one more significant point in 
favour of the whole scheme of interpretation of nylon photographs proposed in this 
paper. 

The structures of crystals of the two polymers are defined by the following 
figures. Only the coordin ates of oxygen atoms are given; the amount of information 
available is insufficient for the determination of the coordinates of chain atoms. 
The structure of the chains (and, indeed, that of any one 010 layer of molecules) 
must, however, be very similar to that found in a crystals. 

66 polyamide, ^ crystals 

Unit cell a— 4*9A a =90° Space group PI. 

6 = 8-0 A = 
c = 17*2A 7 = 6 *}° 

Chains lie along the lines 0‘0a, 0*256 and O-Oa, — 0*256. 

Oxygen atoms of reference molecule at — 0*371«, — 0*2506, + 0 * 1 10c 

and -f 0*371a, —0*2506, +0*690c 

6.10 polyamide. /? crystals 

Unit cell a= 4*9 A a = 90° Space group PT. 

6 = 8*0 A y?=77° 
c = 22*4A 7 = 67° 

Chains lie along the lines 0*0a, 0*256 and 0 * 0 a, —0*256. 

Oxygen atoms of reference molecule at — 0*370a, —0*2506, 4-0*195c 

and +0*370a, —0*2506, ■f0*645c 

The reason why only one set of reflexions appears on the photographs is that the 
proportion of crystals in the specimens is small. The OOZ reflexions appear because 
they fall very near the meridian of the photographs, where geometrical factors 
make for enhanced intensities (see Cox & Shaw 1930). Reflexions from hhO and 
hhl planes (AA;9 for 6.10 polymer) should have appreciable intensities, but they 
necessarily coincide with reflexions from a crystals, on account of the close relation 
between the two arrangements. 

Layer line streaks. All the spots on the X-ray photographs of 66 and 6.10 poly¬ 
amides have now been accounted for. In addition to the spots, there is one unusual 
feature in the photographs of most specimens. On each of the lower layer lines of 
spots, near the meridian, is a streak connecting the spots. The most intense streak 
is on the second layer line, where it connects the strong a 002 spot with the weak 
P 002 spot. The intensities of the streaks vary in photographs of different specimens. 

Such streaks, though unusual, have been seen before in X-ray photographs of 
several different types of substances—^the fibrous mineral chrysotile (Warren & 
Bragg 1930), the fibrous protein keratin after stretching in steam (Astbury & Street 
1931; Astbury & Woods 1933), and certain organic substances, such asp-azoxyanisol, 
which on heating are capable of forming a liquid crystal phase before complete melting 
occurs (Bernal & Crowfoot 1933). The explanation suggested by Warren & Bragg 



56 


C. W. Bnim and E. V. Garner 


in the case of chrysotile is that one of the crystal axes—^the b axis—^is variable in 
length; the a and c axes are fixed by the nature of the structure, but it is possible 
to imagine alternative arrangements which would alter the b axis. For keratin, 
Astbury & Woods suggest a similar explanation, that one of the crystal axes is 
variable owing to a hydrolytic breakdown among the side-chain linkages. For the 
polyamides it is scarcely possible to entertain an explanation of this type, since the 
molecules are made from well-defined monomers of a high degree of purity, and the 
structure is such that variation of the side-spacings seems unlikely. 

The test of such an explanation is whether or not streaks are formed on the 
equator as weU as on the upper and lower layer hnes. The position is usually confused 
by the presence of ‘white’ radiation in the X-ray beam, which produces equatorial 
streaks with quite normal crystals. A photograph of 66 polyamide was therefore 
taken with copper Kcc radiation monochromatized by reflexion from the 002 plane 
of a pentaerythritol crystal. On this photograph (figure 11 , plate 1 ) streaks on 
upper and lower layer fines remain, but there are no streaks on the equator; hence 
there is no variation of the side-spacings. 

The layer-fine streaks found in X-ray photographs of substances like p-azoxy- 
anisol have been interpreted by Bernal & Crowfoot ( 1933 ) as evidence that the 
longish molecules have a certain degree of freedom of position with respect to each 
other; they may slide along their major axes to a limited extent, thus causing the 
unit cell to be slightly variable in shape, this giving rise to reflexions drawn out 
along the layer fines. (At higher temperatures the molecules gain complete freedom 
of movement in some but not all respects, this type of phase being a typical liquid 
crystal; at still higher temperatures the substance becomes a true liquid in which all 
movements are possible.) 

For the polyamides we suggest an explanation derived originally from the 
conception of two crystal forms differing from each other only in the relative 
positions of successive sheets of molecules. A large number of layers arranged as in 
figure 16 (left) constitutes an a crystal; a large number arranged as in figure 16 
(right) constitutes a ji crystal; but in some regions there may be parallel growths of 
y? on a crystals (this is quite likely because the sheet structures are identical), and 
further, there may be some indiscriminate placing of layers as in figure 17, in which 
small sections of a and groupings follow each other at random. Such a composite 
grouping would give an X-ray reflexion at a position intermediate between those of 
corresponding a and p reflexions, and the existence in the specimen of composite 
groupings containing all possible proportions of a and types means that X-ray 
reflexions are produced at aU positions between those of true a and /? spots. The 
streaks must follow the layer fines, because whatever the mode of packing of the 
molecules, the repeat distance in the molecules (which determines the layer fine 
spacing) remains constant. This is what is observed in the photographs—^for instance, 
in figure 11 , plate 1 , a 002 is connected to /?002 by a continuous streak which 
follows the 2 nd layer fine (on photographs taken in a cylindrical camera it is a 
straight line). It is also worth noting that in figure 11 , plate 1 , the streak does not 



67 


The crystal structures of two polyamides nylons'^) 

appear to cross the meridian of the photograph; it merely joins the a to the y? spot 
without going beyond the y? spot. This is consistent with the idea (embodied in the 
above explanation) that there is no disturbance of the structure of individual layers 
of molecules; the ac angle 77° in each layer is maintained, and this ensures that, 
whatever the positions of neighbouring layers of molecules (within the scheme 
given), the angle made by 002 with the c axis is never greater than 77°, and this is 
the reason why the streak does not cross the meridian of the photograph. (Only 
certain 66 photographs were sufficiently weU defined to show this; in the 6.10 
specimens the orientation of the crystals was not good enough.) 

I-yS-1 



Figtoe 17 . a, p and intermediate forms of nylon. 

Lines represent sheets of molecules seen edgewise. 

This explanation is similar to that suggested hy Bernal & Cro-wfoot for j)-azoxy- 
anisol, except that in the latter, continuous variation of molecular position over a 
limited range is postulated, whereas for the polyamides we suggest an indiscriminate 
succession of alternative, quite definite layer positions. It seems likely that in the 
polyamides the layer positions would be definite, to allow the chain y.igr.g.g to fit in 
the same way as in a and § crystals. There is some evidence in favour of this; fibres 
drawn at room temperature from quenched specimens (see Fuller et al. 1940 ), or 
drawn from the melt into cold air (see our figmre 19, plate 2 ), give X-ray photo¬ 
graphs showing no spots, but streaks only, on all except the equatorial layer. There 
is evidently much disorder in such fibres, but it is noteworthy that the streak on the 
7th layer for 66 pol 3 Tner (the 9th for 6.10) has a moderate intensity; the spacing 
for this layer is 2-46 A, the average zdgzag repeat distance, and the intensity of the 
streak—comparable (in relation to the rest of the pattern) with that of the corre¬ 
sponding crystal reflexions of more ordered specimens—^me'ans that the zigzags are 



58 


C. W. Bunn and E. V. Gamer 


in register just as in the crystals. Eor a similar reason, the fact that the streak on the 
2nd layer is strong means that the oxygen atoms are also in register as in the crystals. 
In fact, the relative intensities of the streaks on the different layer lines are consistent 
with our picture of composite a-/? groupings rather than with a more complete 
disorder. 

The fact that it is possible to explain the layer-line streaks in a manner arising 
naturally from the two-crystal theory is regarded as another point in favour of the 
present scheme of interpretation. The complexity of the interpretation (two triclinic 
crystal forms together with composite groupings) is matched by the wealth of 
evidence (positions and intensities of spots, orientation of crystal planes, and the 
intensities of layer-line streaks). 

AmorpJious material. The only remaining feature of polyamide fibre photographs 
is the diffuse ring passing through the first intense equatorial spot. Most X-ray 
photographs of polymers show such a ring, which indicates the presence of a certain 
proportion of molecules or sections of molecules not arranged or oriented in any 
precise way. The distances between such molecules are not constant but are likely 
to be equal or a little greater than the side-spacings in the crystals, and this is why 
the diameter of the ring is such that it passes through the inner of the two intense 
equatorial spots which show the side-spacings in the crystals. (The inner of the two 
rings corresponds to the greater spacing.) 

The possibility of the presence in the amorphous material of molecules not having 
the straight-chain structure of the bulk of the material must not be overlooked. But 
observe in the first place that in view of the spacing of the amorphous ring (4-5 A) 
there is no room for side-chains. Secondly, the presence of significant amounts of 
large ring molecules (which would assume a long loop form with side-spacing similar 
to those of the chains—^see Muller (1933)—^is considered unlikely by Carothers (1940)). 

All the features of X-ray diffraction photographs of 66 and 6.10 polyamides have 
now been accounted for. One is now in a position to consider the intermolecular 
forces in the crystal, and then to pass on to questions of texture and physical 
properties. 

Interatomic distances and intermolecular forces. The most striking feature of the 
a structure (figure 13) is that the oxygen of one molecule is found opposite the nitro¬ 
gen of the next; and the distance between these two atoms is 2-8 A, in both the 66 
and 6.10 polymers. The Van der Waals radius of NH is about 1-8 A and that of 
O 1-35 A (see Albrecht & Corey 1939), hence if there were no electrostatic interaction 
the N—0 distance would be expected to be 1-8 -h 1-35 = 3-15 A. The shorter distance 
found, 2*8 A, is characteristic of hydrogen bond formation: it is in good agreement 
with the N—0 distances found in glycine (Albrecht & Corey 1939), acetamide 
(Senti & Barker 1940), alanine (Levy & Corey 194^) diketopiperazine (Corey 

1938). 

The question whether the molecules have the amide or imidol structure is not 
answered by the present work: atomic parameters caimot be determined with, 
sufficient accuracy. This is illustrated by the figures for the C—0 distance, which in 



59 


The crystal structures of two polyamides {‘nylons'') 

the 66 polymer appears to be about 1-35A and in the 6.10 polymer 1-28A; it is 
obviously not possible to decide whether the bond is single (for which the usual 
distance is 1-43 A) or double (1-25 A). The distance from carbonyl carbon to nitrogen 
appears to be about the same in both polymers (^ 1*40 A); but this must be regarded 
as fortuitous, although the figure does agree with that found in other amides. 

Apart from the hydrogen bond distance between JST and 0 atoms, all the other 
distances between atoms of neighbouring molecules are the normal ones character¬ 
istic of the Van der Waals forces. The hydrogen bonds link molecules together 
in sheets in the 010 plane; but the hnks in other directions are much weaker, and the 
whole structure is thus of a pronouncedly layer-like type. It is this feature which 
is responsible for the plane orientation in roUed or pressed sheets—^the sheets of 
molecules linked together by hydrogen bonds are found parallel to the plane of the 
sheet of material. In a more general way, it is likely that the high tensile strength 
of nylon fibres, which enhances their practical value, owes something to the hydrogen 
bonds: when a fibre is broken it is likely that individual chains are not broken, but 
that crystals are puUed apart by shear (molecules slipping past each other parallel 
to their lengths), and this would be the more difficult, the stronger the forces holding 
the molecules side by side. 

The other features of the crystal structure we have to try to understand are: 
( 1 ) the orientation of the plane of the zigzag about 20 ° to the 010 plane, and ( 2 ) the 
mode of packing of the layers. 

( 1 ) The relative positions of any two neighbouring molecules in one layer (e.g. 1 
and 2 in figure 13) are settled primarily by the hydrogen bonds, which keep the 
oxygen atom of one molecule on the same level as the nitrogen of the next, and fix 
the ac angle of the cell at 77°. But this has the effect of placing the CHg groups 
awkwardly; 

\b.^ 

These groups do not interlock; the projections of one chain are on the same level 
as the projections of the next. This is presumably the reason (or one of the reasons) 
why the zigzag chains turn away from the 010 plane; it is an attempt to relieve the 
packing situation. Possibly the positions of the hydrogen nuclei have something to 
do with the actual angle assumed. The principles of packing of hydrocarbon mole¬ 
cules are not entirely clear, but it may be that the CHg group should not be regarded 
as an unresolved packing unit (as the electron density distribution might suggest), 
but that the positions of hydrogen nuclei are important. Mack ( 1932 ) has suggested 
that in several hydrocarbons the packing is determined by this sort of interlocking 
of hydrogens, while the arrangement in polyethylene (Bunn 1939 ) may be regarded 
in the same way. Examination of a model of the present polyamide structure 



60 


C. W. Btihii and E. V. Gamer 


suggests that here also the arrangement of hydrogen nuclei may play a part in 
determining the orientation of the chains. 

( 2 ) As mentioned above, any one layer is displaced along the c axis by 3-55 A with 
regard to its neighbouring layer. Why the distance 3*55 A? 

In the first place, the layers would not be expected to be on the same level, 
because such an arrangement would not interlock; this is true, whether a CHg 
group is regarded as an unresolved packing unit or whether one takes into account 
Mack’s principle of hydrogen interlocking. The side-by-side positions of the layers 
are, of course, also concerned in the packing; but here there is no alternative to the 
arrangement in figure 12, where molecule 4 fits into the hollow between 1 and 2. 
The available levels consistent with good packing* are those with displacements of 
about 

and so on. Obviously the first-mentioned position would not be favoured, because 

the polarized groups would be too close: arrangements of this sort 

+ — + — 

H—N—C=0, H—N—0=0 

are not likely to be stable. In the second position, with a displacement of about 
3*5A (thatactuallyfound in the crystals),these polarized groups would be sufficiently 
far apart to have little interaction, but it is difficult to see why this second position 
should be any better than the third (to say nothing of other positions); any ex¬ 
planation must be valid for both 66 and 6.10 polymers, and so far no satisfactory 
explanation has been found. Presumably this will become clearer as knowledge 
of similar structures accumulates. 

Optical properties. In the triclinic system, the three principal vibration directions— 
the three axes of the indicatrix—^are not necessarily related in any simple way to the 
axes of the unit cell. To measure the three principal refractive indices associated 
with these vibration directions, it would be necessary to have single crystals. 
These are not available; the nearest approach is a doubly-oriented strip in which 
there are four different orientations of the umt cell. However, the structure is such 
that the refractive indices for the vibration directions defined in the doubly- 
oriented strip are likely to be near the true principal values, since the principal 
axes of the indicatrix are likely to lie near the axes of the doubly-oriented strip. 
The three indices of the strip will be called a', /i' and 7 '; a' is the index for light 
vibrating perpendicular to the plane of the strip (perpendicular to the molecular 
sheets), y? the index for light vibrating in the plane of the strip and at right angles 
to the molecular axes (that is, along the C=0 bonds), and 7 ' the index for light 
vibrating along the molecules (see figure 5 ). 

For the 6.10 polymer, and 7 ' were measured on the Abbe refractometer; the 
strip, suitably oriented, was put in optical contact with the prism by a filTYi of 

* It is mpossible to illustrate these points by diagrams. The remarks made here are based 
on a detailed examination of a scale model. 



61 


The crystal structures of two polyamides {'nylons^) 

bromonaphthalene. The remaining index a' was obtained by making use of the 
properties of a fibre, which gives y' for light vibrating along the fibre and the average 
of a' and P' for light vibratiag across the fibre (on account of the random orientation 
of the crystals in this plane). From the value of thus measured (1-500— 

this was done under the microscope by the immersion method), a' was calculated. 
The three indices are a' = 1-475, = 1-525, y' = 1-565. 

For the 66 polymer, y'( = 1-580) was obtained from a doubly-oriented strip on the 
Abbe refractometer, and also from a fibre under the microscope. (= 1-520) 

was also obtained from the fibre (as for the 6.10 polymer). To calculate a' and 
separately, use was made of the interference figure produced in convergent polarized 
light. A doubly-oriented strip viewed normally gives a figure very similar to that of 
a biaxial single crystal, except that there are no dark hyperbolae passing through 
the ^ eyes ^ (figure 206, plate 2). Ignoring for a moment this abnormality, assume that 
the distance between the ^ eyes ’ is a measure of the optic axial angle. By comparison 
with crystals of known optic axial angle, it was found that the angle 2F for the 
66 polymer is about—45°. From the approximate relation cos^ F = (/?' —a')/(y' —a'), 
and from the already known values of y' and i(ct'+/?'), the two indices a' and 
were calculated. The three indices are a' = 1-475, y?' = 1-565, y' = 1*580. 

The abnormality of the interference figure—^the absence of dark hyperbolae 
through the ‘ eyes ’—^is probably due to a variation of the optic axial angle in different 
small regions of the field of view or in layers at different distances from the surface of 
the sheet. Such a variation of the optic axial angle would be expected owing to the 
occurrence of different degrees of perfection of orientation of the crystallites; this 
would not affect the lemniscates to any great extent, but would remove the hyper¬ 
bolae. If this explanation is correct, the positions of the ^ eyes ’ would give a reliable 
though approximate indication of the average optic axial angle. 

Some doubly-oriented sheets showed, in some regions, much more abnormal 
effects; one example is illustrated in figure 20c, plate 2. It is possible that such effects 
are due to the presence of a small proportion of crystals having a different orientation 
from the majority, depending on the direction of flow in different parts of the 
specimen during pressing or rolling. Experiments with mica sheets showed that 
such effects can be simulated by superimposing a very thin sheet on a thicker sheet, 
the optic axial planes being at right angles. It requires very little of a secondary 
reversed orientation to produce these effects. 

The interference figure given by 6.10 specimens (figure 20a, plate 2) shows that the 
optic axial angle is greater than that of 66; this is in accordance with the measured 
values of the refractive indices, which give 2F = 90°. 

The maximum refractive index is, as would be expected, for light vibrating along 
the molecules; this is a consequence of their chain character. Of the other two 
indices, the greater is for light vibrating in the plane of the strip, that is, along the 
C=0 bonds; this is in accordance with expectation, since double-bonded groups 
have a higher index for light vibrating along the bond than for light vibrating across 
the bond. And it is noteworthy that is higher for the 66 than for the 6.10 polymer; 



62 


C. W. Btinn and E. V. Garner 


this is due to the higher proportion of C=0 groups in the former. Finally, aJ is the 
same for both polymers; this is not surprising, since the vibration direction for a! 
is across the double bonds, which therefore would not affect the value of the index. 

Thus the optical properties of polyamide specimens are thoroughly consistent 
with the structure deduced by X-ray methods. (The optical properties of the ^ 
crystals in the specimens would be expected to be very similar to those of a crystals.) 

Texture of nylon, specimens. Unoriented nylon is opaque, except in very thin sheets; 
in this it resembles polyethylene, guttapercha and other crystalline polymers. 
The optical discontinuities responsible for the opacity are not boundaries of single 
crystals, for the crystals are too small to be seen microscopically and, judging from 
the diffuse character of the X-ray reflexions, are probably much smaller than 
1000 A in diameter—^that is, very much smaller than the wave-length of visible 
light. The optical discontinuities are the boundaries of spherulitic aggregates of 
crystals. The spherulitic structure can be seen clearly in thin films: when examined 
between crossed Nicols, the black crosses indicative of a radial arrangement can be 
seen. The polyamides thus form additional examples of high polymers having a 
spherulitic structure, and this strengthens the impression (see Bunn & Alcock 1945) 
that this may be the normal manner of growth of crystalline high polymers generally. 

In spherulites of polyethylene and rubber, the optical properties show that the 
molecules are tangentially disposed; and this is consistent with the view that crystals 
grow outward from a point by side-by-side accretion of portions of molecules. In 
nylon spherulites, the use of the quartz w^dge shows that for the 66 polymer the 
refractive index for the radial vibration direction is higher than for the tangential 
vibration direction; this is opposite to the state of affairs in polyethylene and rubber, 
and might seem to suggest that in nylon 66 the molecules are radially disposed. But 
this is not necessarily so; the optical properties are equally consistent with the view 
that the crystal direction which lies along the radius of a spherulite is the a axis or 
the nearby hydrogen bond direction. The refractive index for this direction (/?') 
is, for the 66 poljTner, nearly as high as y\ and a spherulite having this structure 
would have y?' as its radial refractive index and + 7 ') (which is less than yff') as 
its tangential refractive index. (Along any one radius there are crystals having all 
orientations obtainable by rotating about the P' direction.) Thus the optical proper¬ 
ties are consistent wdth the radial direction being either the c axis (the molecular 
axis) or the hydrogen bond direction. . The latter seems more likely to be correct, 
since crystal growth in polymers seems to be a matter of side-by-side accretion of 
portions of molecules; and one would expect the formation of sheets by growth 
along the hydrogen bond directions. 

It would be expected that individual crystalline regions would be larger along 
the a axis than along the b axis. In X-ray diffraction photographs of fibres there is 
some evidence which is consistent with this: reflexions from planes like 010,120 are 
rather more diffuse than those from 100 ; this is consistent with a smaller dimension 
along b than along a, though it does not prove it, since there are other possible 
causes of the broadening of reflexions. 



63 


The crystal structures of two polyamides {^nylons') 

When unoriented specimens are drawn into fibres, the crystals become parallel 
to the fibre axis. Fankuchen & Mark ( 1944 ) have taken X-ray photographs at various 
points on the shoulder between drawn and undrawn portions of a fibre, and have 
found evidence that the first thing that happens is that a particular plane becomes 
oriented parallel to the fibre axis; later, this ‘ribbon’ orientation is succeeded by the 
final chain orientation. The evidence is that, of the two strong rings which are the 
outstanding features of X-ray photographs of unoriented material, the outer one 
becomes contracted to equatorial arcs at an earlier stage than the inner one. The 
present work has shown that the outer one consists of reflexions from 010 and 110 
planes, the former being the stronger, while the inner one is 100 . Evidently the 
010 plane becomes oriented first; this is natural enough, since this is the plane in 
which molecules are held together strongly by hydrogen bonds. A specimen may be 
regarded crudely as a number of flat plates tied together by strings; on drawing, the 
first thing that happens is that the plates tend to become parallel to the direction of 
drawing. 

Crystallographic relations between polyamides and the fibrous proteins. The crystal 
structures of the fibrous proteins such as silk fibroin and keratin (hair) are not 
known in detail, but some of the principal spacings and other features are known, 
and the relations between these and the corresponding features in polyamide 
crystals are worth noting. 

In the first place, the fibrous proteins have a side-spacing of about 4*6 A which is 
remarkably constant from one protein to another (Astbury 1940 ); this is what 
Astbury calls the ‘backbone spacing’—^the distance which shows the spacing of the 
main chains in one particular crystal direction. The other main side-spacing, in a^ 
direction roughly at right angles to the backbone-spacing, varies from one protein 
to another (in y?-keratin, for instance, it is 9-8 A, in silk fibroin 4*3 A), and is appa¬ 
rently due to the side-chains, the size of which varies in different proteins. It is 
considered by some writers that the constancy of the spacing of 4* 5 A is an indication 
that the chains are packed in a similar way in the various proteins, and that hydrogen 
bonds are responsible for the type of packing: 


/ 

-N 

\ 


0 = 0 - 




RHO 


\ 


\ 

c=o- 

/ 

-N 

\jHR 


N—H.0=0 

■0=0'^ \ h - 

\ / 

CHR RHO 

/ 

^ -4i-A-^ 


The two side-spacings in 66 and 6.10 polyamide crystals are 4*4 and 3-7 A. The 
former of these, 4*4 A, is the spacing of the 100 planes of the crystals and it has been 
shown that the 100 planes are bridged by CO.HX hydrogen bonds. And this is 










64 


C. W. Bunn and E. V. Garner 


not all: in doubly-oriented strips of polyamides made by pressing or rolling, it has 
been found that the sheets of molecules joined by hydrogen bonds lie in the plane 
of the strip. Astbury & Sisson ( 1935 ) have made doubly-oriented keratin strips by 
compressing pieces of horn in an atmosphere of steam, and find that the crystal 
plane having the spacing 4-5 A is normal to the plane of the strip; in other words, if 
hydrogen bonds bridge the 4- 5 A plane, the hydrogen bonds lie in the plane of the strip. 
(The side-chains are normal to the plane of the strip.) The correspondence is complete. 

The orientation of crystals in pressed sheets is presumably determined by the 
relative dimensions of the crystals and on their cleavages. Polyamide crystals are 
probably larger along the a axis than at right angles to it, and on pressing a specimen, 
the larger dimension of a crystal would be expected to set itself in the plane of the 
sheet; in addition, cleavage would occur most readily along 010 , making the crystals 
still thinner in the direction normal to the a axis, and thus the tendency for the a 
axis to lie in the plane of the sheet would be increased. Keratin crystals are likewise 
probably larger in the direction thought to contain the hydrogen bonds than in the 
other lateral direction (in this case the side-chain direction) (see Astbury & Sisson 
1935 ); cleavage is less likely to occur in keratin, because some of the side-chains 
are thought to join main-chains by primary bonds. 

Another correspondence is revealed when the whole X-ray pattern of keratin is 
compared with those of the polyamides. Astbury & Street ( 1931 ) remark that all 
but two of the spots in the /?-keratin photograph belong to a single zone—the zone 
of planes perpendicular to the plane in which the hydrogen bonds are thought to 
lie. The photographs of polyamides are similar in this respect; planes perpendicular 
Ik to the hydrogen bond plane give relatively strong reflexions, while those making 
small angles with the hydrogen bond give abnormally weak reflexions. The explana¬ 
tion suggested by Astbury & Street for the non-appearance of the theoretical 
number of reflexions in the case of keratin is that it is due to imperfect or variable 
intermolecular junctions in the side-chain direction—variable owing to the varying 
sizes of side-groups at different points along the main chains. This does seem a 
reasonable explanation; but the fact that similar features are shown by the poly¬ 
amides, in which there are no side-chains, suggests that there may be an alternative 
explanation not involving side-groups; and we suggest that the explanation we have 
put forward for the polyamides—anisotropic thermal vibrations and crystal dis¬ 
tortions—^may also account, at any rate partly, for the phenomenon as it occurs in 
/?-keratin. 

Yet another correspondence is found in the layer-line streaks which occur on 
X-ray photographs of both steam-stretched keratin and the polyamides. In the 
case of the polyamides, such streaks, since they occur only on upper and lower layer 
lines and not on the equator of the fibre photograph, cannot be due to variation of 
side-spacings, and we have suggested that they are due to indiscriminate displace¬ 
ments of successive layers of molecules along the fibre axis. In the case of steam- 
stretched keratin, Astbury & Woods ( 1933 ) suggest that the streaks, which are 
here found on the equator also, are due to the effects of a hydroljrtic breakdown 



65 


The crystal structures of two polyamides {^nylons'") 

among the side-chain linkages. Now this hydrolytic breakdown may have two 
different crystallographic effects; one is to modify the side-spacings, the other is to 
break the covalent disulphide cross-links, and thus to make possible some slipping 
of hydrogen-bonded layers past one another. This last effect would give rise to the 
same sort of partial disorder as that which apparently exists in nylon. 

All these very striking correspondences between the X-ray patterns of the poly¬ 
amides and the fibrous proteins suggest that the crystals are built on the same 
general plan. This plan is now known for two polyamides, and it is encouraging to 
note that the current ideas on the interpretation of protein X-ray patterns do corre¬ 
spond in a general way with what is now known of the polyamides: the conception 
of protein molecules joined in sheets by hydrogen bonds corresponds with the 
known structure of the polyamides. The expectation of finding some general 
structural features common to the polyamides and the fibrous proteins has thus been 
justified; and, moreover, the interpretation of certain details in the X-ray patterns 
of the polyamides has led to suggestions of corresponding interpretations for the 
fibrous proteins. 

Appendix 


Intensities of X-ray reflexions 


The three columns give: indices, calculated intensity, observed intensity. The 
figures given for calculated intensities are proportional to the square of the structure 
amplitude F multiplied by Lorentz and polarization factors, and by a temperature 


factor exp 


J5sin^^ 

- J 


, where jB = 5, this being the figure found appropriate for 


the hkO reflexions of polyethylene (Bunn 1939 ). 


66 polyamide, a form 


100 

762 

vs 

001 

9 

w 

002 

45 

m 

013 

5 

_ 

014 

2 

_ 

010 

12121 

- ws 

oil 

0^ 


012 

6 

— 

003 

10 

vw 

114 

18 

w 

110 

704J 

loi 

25 

^ m 

112 

15 

w 

113 

1 

_ 

004 

2 

- - 

IlO 

641 

1- a 

111 

iJ 


i02 

6 

— 

I03 

27 

w 

104 

3 

- 

210 

00 

1 

101 

6 

— 

102 

2 

— 

103 

0 

_ 

124 

13 

__ 

200 

23 

w 

III 

91 

ww 

I 12 

4 

— 

Il3 

9 

ww 

114 

1 

. -- 

120 

119 

w 

oil 

IJ 

il2 

1 

— 

123 

0 

- 

104 

1 


020 

1791 

ms 

Ill 

5 

— 

0i2 

8 

— 

213 

1 

_ 

024 

0 

_ 

220 

64J 

211 

5 

— 

212 

4> 

vw 

023 

2 

_ 

214 

9 

vw 

310 

2l 

ww 

121 

0 

— 

122 

7/ 

113 

10 

ww 

224 

5 

_ 

210 

oJ 

201 

0 

— 

022 

0 

— 

0l3 

2 

■ 

II4 

1 


120 



m 

1 

— 

202 

2 

— 

223 

2 

_ 

204 

1 


320 


ww 

111 

2 

— 

222 

1 

_ 

203 

0 

_ 

0l4 

1 


300 

iJ 


201 

1 

— 

212 

2 

— 

203 

0 

_ 

I24 

0 


130 

14l 

i 

f ww 

021 

0 

— 

202 

2 

— 

213 

0 

_ 

204 

4 


230 

24 J 

221 

4 

— 

. 112 

0 

— 

113 

0 

__ 

214 

0 

. 

030 

14 

— 

m 

1 

— 

212 

0 

— 

123 

0 

_ 

134 

4 


330 


1 

05l 

0 

— 

122 

0 

_ 

213 

0 

_ 

214 

0 

_ 

510 


V vvw 

211 

2 

— 

122 

0 

_ 

133 

0] 

1 

234 

0 

_ 

410 

9 J 

1 

221 

0 

— 

312 

0 

— 

323 

ol 

r ww 

324 

1 

_ 




311 

1 

— 

322 

0 

— 

313 

1 1 

ll4 

0 

_ 




I21 

0 

— 

222 

1 

— 

233 

ij 

1 

314 

1 

_ 




5Il 

0 

— 

022 

1 

— 




034 

0 








132 

2 

— 








Vol. 189. A 


5 



66 




c. 

W. Bunn 

and 

E. V. 

Garner 










66 polyamide, a 

form {cont.) 






015 large* 

m 

116 

3 

— 

017 

large’' 

s 

117 

ll 


118 

0 

_ 

115 

2 

ww 

126 

11 

— 

127 

large** 


017 

2 


228 


w 

125 

4 

— 

026 

0 

— 

117 

131\ 


207 

14 


Il8 

s] 

005 

7 

ww 

116 

7 

vw 

027 

105j 


137 

15 

^ vw 

138 

2 

— 

115 



006 

0 

— 

Il7 

601 

ms 

247 

1 

238 

2 ) 

vw 

025 

si 

vw 

226 

3\ 

vw 

227 

55 J 

227 

17 


128 

9J 

I05 

eJ 


I06 

5J 

007 

0 

— 

317 

15 


008 

1 

— 

225 

0 

_ 

216 

4 

vvw? 

137 

0 

— 

147 

U 


038 

1 

— 

105 

0 

— 

106 

0 

— 

i07 

131 


217 

7 

— 

I08 

2 

— 

215 

3 

— 

126 

3^ 


127 

43 

- m 

347 

4 

— 

218 

0 

— 

125 

1 

_: 

136 

si 

— 

217 

41 

507 

19^ 


108 

0 

— 

205 

0 

— 

236 

iJ 


237 

12 J 


117 

6 


218 

0 

— 

135 

1 

_ 

206 

0 

— 

107 

35l 


047 

12 

vw 

328 

1 

— 

II5 

1 

— 

036 

0 

— 

037 

281 

w 

517 

15 


338 

1 

— 

235 

0 

— 

316 

0 

— 

217 

261 

1 

437 

17' 





015 

3 

— 

326 

0 

— 

207 

21 

p mw 

427 

13 

— 




215 

0 

—1 

il6 

2 

— 

327 

26 J 








205 

3 

— 

0 l6 

0 

— 

337 

20 

— 







325 

4 

— 

206 












035 

1 

— 

336 

1 / 











215 

0 

— 













315 

4 

— 
















* Beflexions on meridian; cannot be calculated 

precisely. 









polyarHide, a form 






100 

1330 

VS 

001 

17 

mw 

002 

49 

s 

003 

10 

vw 

014 

101 


010 

18501 

vvs 

loi 



012 

5 

ww 

013 

7' 


114 

llj 

110 

1240j 

101 

41 

mw 

112 

1 

— 

113 

13. 

i04 

14 

w 

210 

90'1 

s 

oil 

1 | 

I02 

14 

w 

103 

3 


104 

0 


IlO 

921 

111 

ej 


102 

2 

— 

103 

4 

— • 

114 

7 

vvw 

200 

33 

m 

III 

6 

vvw 

II2 

.. 2 

— 

113 

1 

— 

124 

0 

— 

120 

166 

m 

Oil 

0 

— 

012 

4 

vvw 




214 

1 


020 

1061 

J- ms 

ill 

0 

— 

Il2 

3 

VVW' 

013 

1 

] 

II4 

31 

1 

220 

59 J 

211 

5 

ww 

212 

1 

— 

213 

4 

h VVW 

024 

0 . 

\ - 

310 

11 

f- vw 

3ii 

2 

— 

122 

0 

■—^ 

123 

2 , 

J 

0l4 

V 

1 . 

^1,0 

oj 

Ill 



202 

0 

— 

023 

1 

_ 

224 

11 . 

( 

120 


1 

201 

4 

— 

212 

0 

— - 

203 

- 2 

1 




300 

2 ' 

y .vw 

121 

ij 


022 

1 

1 

223 

1 





320 

3j 

1 

201 

2 

-r- 

202 

0 

1 

213 

1 





230 

191 

1 

021 

0 

— 

112 

0 

( “ 

203 

2 

_ 




130 

22 J 

r 

221 

1 

— 

222 

1 

] 







030 

20 ' 














330. 

8 














310 

15 

- ww 













410 

17 














015 

16 

?* 

016 

5 


117 

4 


118 

6 





115 

ir 

1 . w 

116 

1 

— 

127 

21 

128 

4 

_ 




005 

7 

1 

006 

2 

— 

007 

. 3 


028 

2 





I05 

0 

— 

126 

1 

— 

Il7 

3 


118 

24 

W 




125 

3' 

\ 

I06 

91 

w 

027 

1 

_ 

008 

1 

_ 




Il5 

0 

- 

Il6 

8 | 

I07 

1 

_ 

i08 

7 

■ 




025 

0 . 

} 

026 

0 

— 

227 

1 

_ 

228 

1 





215 

3' 

1 _ 

106 

0 

— 

107 

1 

_ 

218 

0 





225 

1. 

f 

226 

2 

— 




108 

4 





Ii5 

0 

— 

216 

1 

— 













♦ 

Obscured by diffuse ring due to amorphous 

material. 






67 


The crystal structures of two polyamides {^nylons') 

^.10'polyamide. aform{cont.) 


129 

52 

9 

§09 

241 

. vvw 

0.2.10 

0 

— 

119 

1451 


339 

18J 

1.1.10 

9 

— 

029 

74 J 


119 

21 


2.2.10 

0 

— 

119 

75l 

1 

p ms 

0I9 

3 


1.1.10 

16 

vvw 

229 

56j 

209 

28 

" vw 

0.0.10 

0 

_ 

009 

0 

— 

319 

24 

1,0.10 

2 

— 

109 

18] 


i39 

24 


2.1.10 

0 

— 

129 

46 ' 

- ms 

249 

si 





239 

38 

149 

2 

— 




219 

59. 


§19 

101 

j- ww 




139 

0 

— 

349 

9J 




109 

38 

vvw 

li9 

19^ 





§19 

301 

vw 

S09 

241 





329 

31J 

429 

20 

vw 







439 

24 








049 

18J 






66 polyamide, form 


001 

33 

s 

002 

28 

s 

003 

8 

w 

004 

25 

m?* 

005 

53 

s 

006 

0 

— 


* Obscured by diffuse ring 


polyamide, form 


001 

0 

— 

002 

49 

s 

003 

1 

— 

004 

7 

w 

005 

12 

?* 

006 

8 

m 

007 

10 

m 

008 

0 

— 


to amorphous material. 


References 

Albrecht & Corey 1939 J. Amer. Chem. Soc. 61, 1087. 

Astbury 1940 Trans, Faraday Soc. 36, 871. 

Astbury & Sisson 1935 Proc, Boy, Soc, A, 150, 533. 

Astbury & Street 1931 Phil. Trans. A, 230, 75. 

Astbury & Woods 1933 Phil, Trans. A, 232, 333. 

Bernal 1926 Proc, Roy, Soc. A, 113, 117. 

Bernal & Crowfoot 1933 Trans. Faraday Soc. 29, 1032. 

Bunn 1939 Trans. Faraday Soc. 35, 482. 

Bunn 1942 Proc. Boy. Soc. A, 180, 40. 

Bunn & Alcock 1945 Trans. Faraday Soc. 41, 317. 

Carothers 1940 Collected papers on high polymeric substances, 
New York: Interscience Publishers. 

Corey 1938 J. Amen Ghem. Soc. 60, 1598. 

Cox & Goodwin 1936 J. Ghem. Soc. p. 769. 

Cox & Shaw 1930 Proc. Boy. Soc. A, 127, 71. 

Fankuchen & Mark 1944 J. Appl. Phys. 15, 364. 

Fuller 1940 Ghem. Bev. 26, 143. 

Fuller, Baker & Pape 1940 J. Amer. Ghem. Soc. 62, 3275. 
Fuller & Frosch 1939 Amer. Ghem. Soc. 61, 2575. 

Levy & Corey 1941 J. Amer. Ghem. Soc. 63, 2095. 

Lipson & Riley 1943 Nature, 151, 250, 502. 

Mack 1932 J* Amer. Ghem. Soc. 54, 2141. 

MuUer 1933 Helv. Chim. Acta, 16, 155. 

Schoon 1938 Z. Phys. Ghem, 39B, 385. 

Senti & Barker 1940 J. Amer. Ghem. Soc. 62, 2008, 

Warren & Bragg 1930 Z. Kristallogr. 76, 201. 


5-2 



68 


C. W. Bunn and E. V. Gamer 


Descbiption OS' Plates 1 and 2 
Plate 1 

Figxjbes 6 to 11. X*ray dijffipaction photographs (copper kci radiation). 

Figure 6 . Drawn fibre. 

Figure 7. Donbly-oriented strip, c axis parallel to X-ray beam. Plane of strip vertical. 

Figure 8 . Doubly-oriented strip, c axis vertical plane 75° to X-ray beam. Note that many 
of the reflexions of figure 1 are missing. 

Figure 9. Doubly-oriented strip, c axis vertical plane parallel to X-ray beam. Taken on 
cylindrical film, axis horizontal. 

Figure 10. Drawn fibre containing only a form. 

Figure 11 . Ordinary drawn fibre. Monochromatic radiation. Flat film. Note absence of 
equatorial streaks. 


Plate 2 

Figure 18. X-ray photograph of drawn fibre of 66 nylon, horizontal, oscillated 90° to 45° 
to X-ray beam. 

Figure 19. X-ray photograph of quenched drawn fibre of 6.10 nylon. 

Figure 20. Conoscopic interference figures, a, 6.10 strip, h and c, different regions of a 66 
strip. 









Bunn and Garner Proc. Boy. Soc. A, volume 189, opiate 2 





Limitations of the Newtonian time scale in relation to 
non-equilibrium rheological states and a theory of 
quasi-properties 

Bt G. W. S. Blair, D.Sc. astd B. C. Vedtogloxt, Ph.D. 

In eoUalxyraiion with 3. E. CAETYir, B.Sc. 

National Institute for Research in Dairying, University of Reading 
{Communicated by E. N. da C. Andrade, F.R.S.—Received 30 May 1946) 

The behaviour of complex materials imder stress is described in terms of entities which are 
not strictly ‘physical properties’. These so-called ‘quasi-properties’ range from entities 
hardly distinguishable from dimensionally true physical properties to concepts which are 
much less clearly defined. 

Quasi-properties measure an ordered process towards equilibrium rather- than a state of 
equilibrium. The Newtonian definition for equality of time intervals which leads to the 
concepts velocity, acceleration, momentum and force having whole-number dimensional 
exponents, does not apply to ‘quasi-equilibrium states’. In order to keep the Newtonian 
time scale, fractional differential equations are introduced. 

The simplest fractional differential equation relating stress, strain and time integrates to a 
series equation whose first term is a simple power law (Nuttiog’s equation) already known to 
describe the behaviour of many complex materials under constant stress. The physical 
meaning of the fractional differential is considered. 

An apparatus is described for loading test-pieces of plastics and the like under tension or 
compression at constant stress to a preselected strain, and for following the subsequent stress 
dissipation; and the results of tests on thirty-eight materials are studied statistically. 

Introducing a second term from the series equation (and, very rarely, a third term) greatly 
improves the fit for materials for which Nutting’s equation is inadequate and explains 
hitherto unaccountable anomalies when the Nutting plot is otherwise satisfactory. Constants 
derived by the equation from constant-stress and constant-strain conditions are compared. 

The form of the series equation suggests that the relative importance of the second term 
may sometimes disclose the presence of undissipated stresses in the materials. 

The accuracy of tests on individual test-pieces is high, but, on accoimt of frequent lack of 
homogeneity in the samples available, the use of the unmodified Nutting equation is often 
adequate even when the addition of a second term wotdd significantly improve individual 
curves. 

Some alternative treatments are discussed, but, both theoretically and practically, the 
fractional differential approach is preferred for most of the materials tested. 


Introduction 

It has been generally believed since the time of St Augustine [Confessions^ Book xi) 
that, whereas space can be measured directly by the superposition of lengths, the 
measurement of time is indirect, being made either subjectively (as St Augustine 
thought, from memory engrams) or objectively, by definition, from some periodic 
physical process such as the rotation of the earth. St Augustine believed in an abso¬ 
lute time scale independent of the length of the day, and suggested that, even if the 
sun should be stayed in its course, time might still be measured by the rotation of 
the potter’s wheel. 


[ 69 ] 



70 G. W. S. Blair, B. C. Veinoglou and J. E. CafEyn 

In Newtonian physics, times are implicitly defined as of equal length where either 
light or some body free from extraneous influences traverses equal (i.e, superposable) 
distances in them. It is assumed that lengths are themselves invariant. Recently, 
other phenomena have been found to give the same or a simply related time scale; 
e.g. Diugle (1944) equates the times taken by a specified body to radiate equal 
amounts of energy. 

The rotations of the earth give only approximate agreement mth the Newtonian 
scale and, as Dingle {1942) points out, astronomers prefer to say that the earth is 
slowing down rather than to question the accuracy of the Newtonian laws. 

Bodies are said to move at constant velocity when they move through equal 
distances in equal times; but, since equal times are already defined as above, this is no 
more than the definition of an entity, velocity, being the first differential of length 
with respect to time. Acceleration is defined as the second differential, and the 
products wdth mass as momentum and force respectively. 

In other words, equality of time units is so defined as to produce these simple 
whole-number differential concepts and, when a body does not, in fact, travel at 
constant rectilinear velocity, i.e. changes its momentum, force, which was originally 
a psycho-physical concept, is defined as the rate of change of the momentum. 
Einstein’s relativity, while correcting the Newtonian laws for the special case of 
high velocities, does not affect the Newtonian time scale itself. Milne’s relativity is 
not so easily dismissed, since no single time scale is regarded as universal and the 
measure of time takes priority over that of length. For the intervals of time with 
which we are concerned in the testing of materials, however, Milne’s theories do not 
in any way affect our present line of argument. 

From the Newtonian scale are derived all those physical properties which so 
conveniently give universal and generally integral indices of mass, length and time. 
But it has long been realized that the Newtonian time scale, in spite of its invaluable 
contribution to physics, remains an arbitrary scale. As Poincar6 (1904) points out: 
‘La defimtion implicitement adoptee par les astronomes peut se resumer ainsi: le 
temps doit etre defim de telle fa§on que les equations de la mecanique soient aussi 
simples que possible.’ The Newtonian scale is the simplest for the study of most of 
the problems of physics. 

For biological phenomena, however, du Noiiy (1936) suggests an alternative 
time scale based on the rate of healing of aseptic wounds and points out that, 
smce the units of this physiological time progressively increase in relation to 
the Newd:onian time umts, time will seem to pass more quickly as man ages. 
This raises an interesting parallel with Milne’s theory, according to which the period 
of a pendulum shortens as we recede in time towards the finite 'beginning’ of the 
atomic time scale. 

This linking of physical, physiological and psychological time is especially in¬ 
teresting in view of recent experiments on the quantitative effects of temperature 
changes on the psychological time scale in man and in bees, ants, etc. (Frangois 
1928, Pieron 1945). Hoagland (1935) claims that the relationship between rate of 



Limitations of the Newtonian time scale 71 

counting and absolute temperature of a feverish patient is governed by a law of the 
Arrhenius type. 

Bergson (1889) gives a wider definition of the equality of time intervals: 'Two 
intervals of time are equal when two identical bodies, in identical conditions at the 
beginning of each of these intervals and subject to the same actions and influences 
of every kind, have traversed the sanie space at the end of these intervals.’ It will 
be noticed that this definition postulates the existence of bodies whose conditions 
are not permanently affected by the actions and influences during the specified time. 
According to Bergson’s philosophy, living matter could never qualify, 

Schrodinger (1944} describes living matter as' "doing something”, moving.. .and 
so forth, and that for a much longer period than we would expect an inanimate piece 
of matter to "keep going” under similar circumstances’. 

But there are many inanimate colloidal systems which behave in this way and 
hence do not qualify for Bergson^s defibnition. In studying the behaviour of such 
systems one is at liberty either to use a non-Newtonian time scale, which is in¬ 
convenient since each system or group of systems has its own time scale; or, using 
the Newtonian definition, to make use of differentions of length with respect to time 
which are of fractional orders. 

It will be seen that these alternatives are equivalent, since, as explained above, 
the definition of velocities and accelerations as first and second derivatives is 
effected by selecting the Newtonian definition of time equality. It is therefore 
proposed to keep to the Newtonian definition but to define two new generalized 
entities—^motion, having dimensions and drive, with dimensions 

in which z is a fraction. 

In the experiments described below, these ideas were developed in relation to two 
processes: (1) the straining (tension or compression) of cylinders or strips of various 
colloidal materials—^plastics and the like—under constant stress, and (2) the relaxa¬ 
tion of stress of the same materials when subsequently held at constant strain. It 
is clear that the concepts of motion and drive need only be applied to conditions 
within the materials tested. Thus straining may be done at constant stress and not 
at constant drive per unit area, since the loads applied to the test-pieces are naturally 
to be treated in the ordinary Newtonian manner. Within the test-piece, however, 
the concept of stress must be regarded with caution. 

In order to approach this rather difficult problem it is best first briefly to review 
the real significance of the rheological equations of the simple prototypes, the 
Hookean solid and the Newtonian liquid. (To avoid confusion, the latter will be 
referred to as the 'true liquid’.) 

The simple Hookean system is generally a crystalline structure with regularly 
spaced inter-atomic potential troughs. The linear relationship between stress and 
strain within the small elastic range represents no more than an approximate 
cancellation of opposite curvatures from the power laws of attraction and repulsion. 
The most convenient entity by which to express the stress-strain relation is the 
rigidity modulus {n) (JfL”^ jP-^), i.e. the simple ratio of (Newtonian) stress to strain. 



72 G. W. S. Blair, B. C. Veinoglou and J. E. Caflfyn 

Both the states of stress and of strain can be described by tensors made up of the 
three spatial components of the tractions and displacements (respectively) on each 
of the three faces of the unit cube. These ninefold tensors are symmetrical, for 
isotropic systems, so the only concern in this case is with six components. The 
appropriate vectors can be selected from these tensors to give the usual elastic 
constants, though in the work described in this paper volume changes can safely be 
ignored. 

At the other end of the scale are the amorphous systems which tend to fluid be¬ 
haviour. Here the reaction of the material may be regarded as uninfluenced by 
previous straining, so that the simple relationships between stresses and strain rates 
are independent of the magnitude of the strain and can thus be derived by a simple 
differentiation of the elasticity equations, for pure deformations. 

The entity by which the stress-strain-time {S :cr:t) relation is best described is the 
ratio of shear stress to rate of shear (viscosity, ^). Just as Hooke’s law is no more than 
an approximation, so Grunberg & Mssan (1945) have shown that even for true 
fluids the viscosity is only constant under limited conditions. 

Although there are some important materials which approximate to Hookean or 
to truly fluid behaviour, these are the exception rather than the rule. Between the 
crystalline and amorphous materials are many others whose structure is defined to 
agreaterorlesserdegree. Many of the so-called ‘high-elastic’ materials show approxi¬ 
mately Hookean behaviour over large ranges of strain. The mechanism of this is 
quite different from that of ordinary elasticity, but the behaviour of the material 
may be described in terms of a simple ratio of stress to strain where this is found to 
be approximately constant. For really large strains, however, the strain axes do not 
lie at 45 ° with the direction of shear, so that although shear stress is proportional to 
shear strain, tensile stress is not proportional to tensile strain. Under these condi¬ 
tions, Mooney (1940) proposes a second modulus (for rubber) which is called the 
‘Coefficient of Asymmetry.’ 

Much work has been done on ‘high elasticity’, but in many otherwise valuable 
contributions to the subject it has been forgotten that the equations relating the 
tensile and shear moduli of elasticity are derived only by m akin g certain assumptions 
such as that strains are small; and that these equations are not valid for large strains 
even when Poisson’s ratio is | (British Rheologists’ Club 1946). 

There still remain many materials—^plastics, bitumens, pastes, unvulcanized 
rubbers, etc.—for which the inter-atomic potential troughs are very irregularly 
spaced and for which neither /S/cr nor S-^d<rjdt is constant. When these materials 
are strained, they are not, during the straining, in an equilibrium state in the usual 
sense of the term. By a system in equilibrium is generally meant one in which there 
are no unbalanced forces—^force being here used in accordance with the Newtonian 
definition, although the term is also used less precisely to imply a balance of gains 
and losses, as when it is said that living matter maintains a state of equilibrium. 

It is often necessary to study rheological behaviour in systems which are not in 
Newtonian equilibrium, and, as might be expected from what has already been said. 



73 


Limitations of the Newtonian time scale 

it is found that the combinations of /S, cr and t which are found to be constant do not 
prove to be physical properties in the sense that they do not have whole-number or 
even universal exponents of L and T\ nor is the process by which they are 
measured reversible, in that the states through which the material passes are not equi¬ 
librium states, so that tests, though reproducible, must be repeated on fresh samples. 

As typical of such entities take ^firmness", which, for many complex industrial 
materials is of great practical importance, can be reproducibly measured on replicate 
samples and in some cases by quite independent physical methods but which has 
dimensions which depend on the material or group of materials. 

It has often been pointed out that such entities are not to be regarded as ‘ physical 
properties’ in the strict sense of the term, and it has been suggested that, except 
for purely ad hoc industrial purposes, treatments involving them should be avoided. 
The alternatives, however, are far ftom satisfactory; they are, either to ignore such 
materials altogether or to presume that they are mechanical mixtures of simple 
Hookean and true-fluid prototypes or at least that they may be treated as such. 

This latter assumption is occasionally justified, but although it is usually possible 
to reproduce an experimental rheological curve by fitting a number of exponential 
terms and always possible to invent a new prototype on conveniently exponential 
lines, neither process does much to further our knowledge of the behaviour of real 
materials. We propose, then, to accept the entities of the kind described above and to 
refer to them as ‘quasi-properties’. They are a measure rather of a process than of 
a state, but it is a steady process, and, though not at equilibrium in the Newtonian 
sense, we feel justified in describing it as a state of ‘quasi-equilibrium’. Thus the 
‘process ’ of work-hardening of a metal imder pressure from a sphere in an indenta¬ 
tion hardness test is measured by the Meyer exponent—^there are many illustrations. 

While it is true that the measurement of a ‘property’ must not be altered by the 
process of measurement (the axiom underlying Heissenberg’s principle), there would 
seem to be no objection to measuring the process of change produced by the test 
provided that the methods are standardized and the results are independent of the 
observer. 

It is clear that quasi-properties will be of different ‘orders’. Some wOl hardly 
differ from true physical properties in that the exponents of MLTy although frac¬ 
tional, will be the same (within the limits of experimental error) for quite large groups 
of materials, and the quasi-properties themselves will be reproducible by widely 
different physical methods of measurements. At the other end of the scale will be 
quasi-properties of a low order which, although reproducible on replicate samples 
by some standard method, will neither be unique dimensionally for any appreciable 
range of materials nor equal quantitatively when derived from different types of 
test. Normally, one is faced with the choice of selecting a large number of true 
properties or a smaller number of quasi-properties, since it is nearly always possible 
to express rheological behaviour in terms of true properties provided enough 
constants are fitted. 

It is hoped to show that the latter course is by no means always the best. 



74 


G. W, S. Blair, B. C. Veinoglou and J, E. CafEyn 


Mathematical treatment oe quasi-properties. The principle 

OF INTERMEDIACY 

In many industries, the firmness of materials is assessed by handling by experts, 
and psycho-physical studies have been made of the thresholds (i.e. smallest 
perceptible differences) for various systems (Blair & Coppen 1939, 1940, 1942, 
1943). It has also been found that experts handling materials tend to judge firmness 
not as if the materials were mixtures of elastic and viscous components but rather 
as though the quality of firmness lay between those of elastic modulus and of 
viscosity. This principle of ranking complex entities as lying at a certain distance 
between the prototypes, which we call the ‘Principle of Intermediacy’, common in 
psychology, is rare in rheology—one of the few existing examples in the latter science 
is the ‘viscosity index’ of an oil. » 

It may be objected that any such treatment will preclude any simple relation 
between the stress and strain tensors and will involve an infinite series of tensors.. 
This is true, but, as has already been pointed out, for the very large strains which 
have to be considered, and, in view of non-equilibrium conditions, the usual tensor 
theory would be inapplicable in any case. Whether it will be possible to develop a. 
new tensor theory, relating drive to strains, we do not know. 

In the meantime, however, there are several possible applications of the Principle 
of Intermediacy which will help, bearing in mind that the materials given are in 
quasi-equilibrium and that there being no Newtonian bodies concerned within the 
material, the use of the second as the uniform time unit is sure to involve motions 
which are neither velocities nor accelerations. It is clear that in order to define a 
quasi-property it is necessary to specify (a) numbers indicating the dimensions of the 
quasi-property and ( 6 ) quantities measuring its intensity. 

The first attempt was made by Blair & Coppen (1939) for constant stress 
conditions and was based on the following argument: 

The entity which is found to be constant for true fluids is given by ^ = Sor’H; 
the entity which is found to be constant for elastic solids is given hy n It 

was therefore proposed that the entity which might well be expected to be constant 
for complex materials would be 

ir = ( 1 ) 

where, normally, 1 > A; > 0. 

This is a special case of a more general equation derived by intermediacy from the 
imaginary protot3q)es S^cr-H and S^or-H^, for which we vriU redefine 

ijr = S^cr-H^, ( 2 ) 

This equation had been empirically proposed by Nutting (1921 a, 6) many years 
earlier. 

A more general treatment (Blair 1944) consider viscosity as defined by 
the equation ^ — cZu/di and shear modulus as = S-r-cr, 



75 


Limitations of the Newtonian time scale 


A quasi-property Xi is then defined by the Principle of Intermediacy: 



( 3 ) 


where p indicates a fractional differentiation. (We propose this convenient symbol 
since it reminds us of both fractions and /’-functions.) 

In our psycho-physical work we had already made use of the fractionally differen¬ 
tiated form of equation (2) for constant stress conditions: 



rjk + l) 




( 4 ) 


Our psycho-physical experiments did not at the time justify us in equating v and 
but Cafifyn & Blair (1945) later showed that if one is prepared to equate 
h and fi (in equation ( 3 )), the equations can be combined, since ( 4 ) becomes 

( 4 a) 

S 


from which it follows that 

Hence + ( 5 ) 

Since /’(/'+1) is a number, in this special case when the Nutting equation holds, 
^ and Xx measure the intensity of the same quasi-property. 

A second equation can also be obtained by fractional differentiation of stress with 
respect to strain 

X2 = 

which, taken together with equation ( 3 ), leads to the general equation 



At present we are not able to make use of equation ( 7 ), but Bosworth (1946), 
quite independently of us, has recently proposed a somewhat similar quasi-property 
as a criterion of plasticity. 

So far, the gamma function has not been much used in rheology. Its relation to 
power-law treatments is discussed by Gemant (1936, 1944) who has used half¬ 
differentials, but, although in his earlier paper he suggests that at least ‘ for secondary 
differential equations [the fractional differential] may really occur in a physically 
realizable case’, in his later paper he expresses the view that it ‘only occurs as a 
useful mathematical symbol, whereas the underlying elementary process, whatever 
it may be, will probably contain differential quotients of an integral order’. His 






76 


G. W. S. Blair, C. Veinoglou and J. E. Caffyn 

view of the matter is clearly very dififerent from ours. Our own view of the physical 
meaning of the fractional differential is somewhat as follows: 

If cylinders of Californian bitumen {which is very nearly a true fluid) are com- 
pijessed under constant stress, the amount of deformation produced will depend 
equally on the stress and on the time of its application. In the case of rubber, which 
may be regarded as nearly Hookean, the strain will depend on the stress only and 
not at all on the time. In the case of a certain plastic, strain is found to depend on stress 
and a fifth as much on the time, i.e. the Nutting equation holds and k = 0*2. When 
this simple equation does not hold for constant-stress experiments, the next simplest 
hypothesis is that, just as there is a constant property for the bitumen (viscosity) 
which is given by the ratio of stress to the first differential of strain with respect to 
time, and a constant ratio (shear modulus) for the rubber given by the stress divided 
by the zero-differential of strain with respect to time, so the plastic has a quasi¬ 
property which is constant and given by the ratio of stress to the 0-2 differential of 
strain with respect to time. It must be remembered, of course, that if this plastic 
were the only physical system to be considered, it would be simpler to use a different 
rule for equating time intervals chosen so that the constant entity would have 
dimensions with whole-number exponents. 

This view has strong experimental support (Blair & Coppen 1943), since it 
is found that in a statistical study of the compression of a large number of cylinders 
of a plastic in direct handling comparison with cylinders of rubber or bitumen, k, 
calculated from the relative importance of squeezing time, agrees (within experi¬ 
mental error limits) with k derived from simple compressions on a machine at 
constant stress. We regard this experimental fact as strong evidence for the validity 
of our line of approach. 

It is known that some materials show a simple power-law relation between stress 
and strain not involving time (Bach’s law), though as Reiner (1933) points out this 
does not involve us in dimensional difficulties. There are, on the other hand, many 
systems for which (in equation (2)) is almost unity, though we have shown this to 
be, in some cases, a chance cancellation of the opposite effects of plasticity and a type 
of dilatancy (Blair & Vemoglou 1944a). 

All these phenomena may not be found to be explainable in terms of the non- 
Newtonian time scale, but certainly many of the anomalies hitherto unexplained 
would seem to be clarified by it. 

Although we have shown (and indeed so have many other authors) that a very 
large number of materials obey the simple Nutting law within the limits of experi¬ 
mental error, it is certainly worth while to consider what happens when the situation 
is not so simple. 

Equation ( 4 ) can be integrated to give an equation of the type 

or =: -f 4- ( 3 ) 

and it is clear that the Nutting equation is a special case of this when A ^ jB, ( 7 ,..., 
etc., and that the addition of subsequent terms would not only account for differences 



77 


Limitations of the Newtonian time scale 

between h and [i but might also express the quasi-properties of those of our materials 
whose agreement with the simple Nutting equation is unsatisfactory. In fact we 
will show in the experimental part that it is rarely necessary to go beyond the second 
term. 

In the light of this introduction we would then summarize the purposes of our 
experiments as follows: 

We beheve that in the case of many complex colloidal materials, such processes 
as simple straining (under constant stress) and stress dissipation (under constant 
strain) are not equilibrium processes in the Newtonian sense, and that the time 
system which would most simply explain their behaviour is not Newtonian time. 
If Newtonian time is used, the entities which most simply describe their behaviour, 
which we call ^quasi-properties’, will not have whole-number dimensions in MLT^ 
and their fractional dimensions will be more or less dependent on the nature of the 
material. We would ask ourselves: (1) With what accuracy can these ‘quasi- 
properties’ be measured in straining and relaxation and how will their values 
obtained by these two very different methods agree with one another ? (2) Simple 
tensor theory is not applicable to these conditions, nor has the necessarily more 
complex non-linear tensor theory been developed; but we have proposed certain 
fractional differential equations from which the ‘quasi-properties’ associated with 
‘firmness’ can be defined. How well do these equations fit the experimental data, 
and how does the fit and the simplicity of treatment compare with the classical 
methods of deriving true ‘physical properties’? 


Expemmeisttal 

Data of a kind suitable to test the theory outlined above are best obtained by 
holding either stress or strain constant and measuring the rate of change of the 
remaming variable with time. There are various ways of extending strips or rods in 
such a way that the load diminishes proportionally to the average change in cross- 
section of the test-piece (volume changes being ignored), but if a single instrument 
is to be used both for extensions and compressions, there is everything to be said for 
the bent lever type of compensator. The limitations and uses of this type of constant- 
stress device has been worked out folly by Caffyn (1944). Blair & Veinoglou 
(19446) have already shown that many complex colloidal systems tested on 
machines of this type follow the Nutting equation with reasonable accuracy. In 
preliminary experiments a few materials were tested after rapid straining in relaxa¬ 
tion at constant strain as well as by comparatively slow straining at constant stress. 

Equation (2) can be partially differentiated for these two processes to give 

^ ~ —for relaxation and ^ i-for straining. Thus the slope of the relaxation 

ct p t ot t 

curve plotting log Sflogt is kj^, and of the straining curve, plotting logcr/log^ is fe. 
The constant ^ can be calculated from the position of a series of curves obtained 
at different strains and stresses respectively. 



78 G. W, S. Blair, B. C. Vemoglou and J. E. Caffyn 

For the few materials tested in this way it was found that the Nutting constants 
and yjr were not only remarkably constant in each experiment but also their 
values obtained in relaxation and in straining were the same within the limits of 
experimental error. 

But to make a thorough study of these materials an apparatus was made by which 
a strip or rod {B) could JBrst be extended at constant stress, the load being com¬ 
pensated for average change in cross-section by means of a bent lever (H) passing 
from the horizontal downwards towards the vertical; or to compress cylinders (A) 
with correctly increasing load effected by a lever moving from the vertical towards 
the horizontal (G). The movement is checked at a preselected strain by means of 
collars set the correct distance apart (at J). The subsequent stress dissipation is 
measured by a strong spring (G) resting on knife-edges, serving as a base for the 
compressed cylinder or a ceiling to which the extended strip is attached. The 
movements of the spring are negligible by comparison with the deformations pro¬ 
duced in the test-piece, and the load on the spring is measured by an optical lever 
(D) which magnifies some thousandfold. 



The deformations of the stretched strip are measured by a pointer on a scale ( F), 
but those of the compressed cylinder are too small for this and are suitably magnified 
by a second optical lever (E) operating from the stirrup which loads the cylinder. 
The whole instrument, which is shown diagrammatically in figure 1, is operated in 

a constant-temperature room, some experiments being done at 15° C and some at 
18° C. 




79 


Limitations of the Newtonian time scale 

With this apparatus, test-pieces could be strained under high or low stress and to 
large or small preselected strains and the subsequent stress dissipations followed. 

Since ‘quasi-properties’ refer to processes rather than to states, the question 
arises whether relaxation should be regarded as a separate process from straining, 
a new time zero being taken; or whether a single time scale should be used throughout 
the double experiment. For very rapid straining the difference between these two 
methods is negligible and the best relaxation data were obtained in this way, being 
compared with data from separate and slower stra in ing experiments. For relaxation 
following very slow straining it is best to use a single time scale. If a new time zero 
is taken there will be an appreciable period before the new state of quasi-equilibrium 
is established and, on accoimt of frictional effects, the instrument is not suited to 
very slow working. Finally, all our data were calculated by the two-process method, 
a small number of data from slow straining experiments being discarded. 

As already pointed out, altogether apart from our treatment, the classical 
elasticity equations relating tensile to shear conditions are not valid for the large 
strains in which we are interested, nor do our materials generally remain isotropic 
on straining. 

It would really have been better if, from tensile stresses and strains, ‘tensile’ 
constants had been derived even though shear modulus, has more fundamental 
significance than Young’s modulus; but, historically speaking, the constant ^ in 
the Nutting equation has always been defined as analogous to viscosity and shear 
modulus rather than to coefficient of viscous traction and Yotmg’s modulus. For 
this reason, ^ is calculated from a shear stress taken as J of the tensile (or compressive 
stress). This is purely a matter of convention and has no theoretical meaning. The 
factor 3 could be replaced by 2(1 + 77 ), but U (Poisson’s ratio) differs little from \ 
for these materials. 

There are various ways of defining large strains. Like Bridgman (1945) we prefer 
the natural strain, log^ (i/Zo) and have used this formula throughout. It has the great 
advantage of being additive. 

When possible, straining curves were obtained at some six constant stresses and 
relaxation curves starting from about four different initial strains for each of these 
stresses. A complete test on a material thus comprised about twenty-four curves, 
though in many cases such a complete experiment was not possible owing to shortage 
of materials. 

Curves from part of a typical experiment with a thiokol are given in figure 2. 
On the left-hand side of the figure, logcr/logZ curves are given for seven different 
constant stresses; and above is the derived curve plotting log 8 against logo'll where 
is the strain after 10 sec. The slope of this curve, by the method of least squares, 
gives /?. 

The part of the relaxation experiment shown on the right of the figure corresponds 
to six different constant strains, all the samples being submitted to the same initial 
stress (log Sq = 6 * 07 ). There were other curves at other stresses, but to show more in 
a single figure would be confusing. Above are plotted the six corresponding points 



80 


G. W. S. Blair, B. C. Veinoglou and J. E. CafEyn 

log<r/log(Sio, -where Sm is the stress after 10 sec. dissipation. The slope of the lower 
curves gives kjfi and that of the upper curve, though of course with many more 
points, gives fi in relaxation. (We are not the first to apply power-laws to relaxation 
of stress. Prache ( 1930 ) gets excellent curves for unvulcanized rubber.) 







81 


Limitations of the Newtonian time scale 

These curves are shown as being reasonably typical. The Nutting treatment 
(without addition of a second term) in this case is not particularly satisfactory in 
relaxation. 

Unless the discrepancies are exceptionally large it is not possible to show clearly 
the effects of fitting the second term graphically or even in tabular form. A rather 
extreme case of a butyl rubber is given in table 1, the Nutting plots of which are 
shown in figure 3 . To economize space, only three stresses are shown. 


Table 1 . Compabison of equations ( 2 ) and ( 8 ) with experimental data fob 

COMPRESSION AND RELAXATION FOB A HIGHLY ANOMALOUS BUTYL RUBBER 


compression 


SxW 

... 

17*8 

A 



21*9 

A 



30*2 

A 



r 


-\ 

f 



f 



t 


0^2 

^exp. 

^8 


^exp. 

^8 

era 

<^exp. 

10 

0*1016 

0*0984 

0*1012 

0*1217 

0*1210 

0*1216 

0*1963 

0*1906 

0*1959 

13 

0*1057 

0*1046 

0*1059 

0*1276 

0*1264 

0*1274 

0*2047 

0*2029 

0*2042 

16 

0*1107 

0*1111 

0*1109 

0-1345 

0*1351 

0*1349 

0*2149 

0*2159 

0*2173 

20 

0*1165 

0*1180 

0*1175 

0*1426 

0*1444 

0*1429 

0*2269 

0*2298 

0*2275 

25 

0*1233 

0*1254 

0*1245 

0*1518 

0*1544 

0*1531 

0*2404 

0*2445 

0*2410 

32 

0*1308 

0*1332 

0*1303 

0*1621 

0*1650 

0*1622 

0*2557 

0*2603 

0*2541 

40 

0*1401 

0*1417 

0*1387 

0*1737 

0*1764 

0*1738 

0*2726 

0*2770 

0*2723 

50 

0*1486 

0*1505 

0*1497 

0*1864 

0*1886 

0*1862 

0*2912 

0*2948 

0*2917 

63 

0*1589 

0*1599 

0*1585 

0*2003 

0*2016 

0*1995 

0*3116 

0*3137 

0*3126 

79 

0*1701 

0*1699 

0*1718 

0*2156 

0*2155 

0*2163 

0*3339 

0*3339 

0-3350 

100 

0*1823 

0*1805 

0*1820 

0*2323 

0*2303 

0*2328 

0*3582 

0*3354 

0*3589 

126 

0*1955 

0*1918 

0*1972 

0*2504 

0*2462 

0*2523 

0*3846 

0*3782 

0*3864 





relaxation 







cr = 0*0624 


O'= 0*146 

A 



cr= 0*263 

A 


t 

r 

Ss 

>^2 

-s 

^exp. 



^S'exp. 

'-Ss 


^exp. 

10 

9*926 

12*050 

10*420 

21*410 

25*500 

22*390 

31*180 

37*350 

32*430 

14 

9*800 

10*340 

9*550 

20*980 

22*020 

20*5],0 

31-000 

32*580 

30*480 

20 

9*147 

8*861 

8*810 

19*560 

19*010 

18*710 

29-280 

28*420 

28-050 

28 

8*212 

7*600 

7*762 

17*600 

16*420 

16*980 

26*650 

24*790 

25*640 

40 

7*161 

6*518 

7*079 

15*440 

14*180 

15*210 

23*620 

21*620 

23*390 

56 

6*130 

5*590 

6*166 

13*300 

12*250 

13*300 

20*560 

18*870 

20*700 

79 

5*171 

4*793 

5*248 

11*320 

10*570 

11*610 

17*660 

16*450 

18*030 

112 

4*319 

4*111 

4*519 

9*543 

9*128 

10-000 

15*030 

14-350 

15*700 

159 

3*583 

3*526 

3*890 

7*997 

7*883 

8*375 

12*720 

12*520 

13*270 

224 

2*958 

3*024 

3*048 

6*670 

6*808 

6*934 

10*700 

10*920 

10*940 

316 

2*431 

2*592 

2*421 

5*547 

5*878 

5*495 

8*976 

9*530 

9*036 

447 

1*995 

2*223 

1*820 

4*603 

5*076 

4*266 

7*515 

8*316 

6-982 

0-8 

is strain and is stress (x 10*) calculated from two terms of equation (8). 



cTg aJi<l ^2 stress (x 10*) calculated from equation (2) (Nutting’s law). 

^<1 ^exp. 8*re experimental values. 

t is time in sec. Initial stress in relaxation (all three curves) = 39-3 x 10* dynes/cm.^. 


The relaxation data illustrate one of those rare cases where even the addition of 
a second term, although effecting a great improvement, still leaves something to be 


Vol. 189 . A. 


6 



82 G. W* S. Blair, B. C. Veinoglou and J. E. Caffyn 

desired* The addition of a third term from equation (8) improves the fit still further, 
hut this improvement would not normally justify the large amount of labour in¬ 
volved in the calculation. 

In introducing a second or even a third term of equation (8), it is important to 
bear in mind their significance over difierent parts of the time scale. For very short 
times when A and B have the same sign, the two-term plot goes through a minimum in 
straining or a maximum in relaxation since the second term with a negative exponent 
preponderates. This has no more physical significance than the minimum in the 
well-known Buckingham-Reiner equation if it is applied to stresses below the 
yield-value (see Blair 1938). The condition of quasi-equilibrium which the treatment 
postulates has not been reached at such small values of t 

But it is curvature in the Nutting curves for comparatively short times that is 
most effectively corrected by introducing the second term. 

A good Nutting curve which deteriorates only after long times is not easy to deal 
with by means of equation (8). Downward curvature of this type is rare, and upward 
curvature suggests an examination of the test-piece for signs of rupture or, in the 
case of extensions, slipping at the grips. 

In relaxation, where deviations from simple Nutting behaviour are on the whole 
more serious, curvature is almost invariably found in the short-time part of the 
curves. 

There are a number of cases where the log cr/log t plot for a series of constant stress 
experiments gives curves which superficially appear to be linear, but it is found that 
the slope, k, increases or decreases progressively with increasing stress. It is thought 
likely that this is due to the fact that the Nutting equation is only the first term of 
the series equation (equation (8)), and that even though the second term correction 
might be small, the value of k, determined over the experimental range, might differ 
significantly from the true slope, i.e. the slope at large strains. If this is the explana¬ 
tion, then any slight upward curvature of the log cr/log ^ curve should lead to an 
apparent increase in k with increasing stress, whereas downward curvature would 
lead to the reverse. Curvature is easily checked by comparing the magnitude of k 
from the Nutting equation and k' from the two-term fit, and it was found in the 
great majority of cases that any tendency for k to vary with stress was in the sense 
predicted on this hypothesis. Evidence is, however, not adequate to state cate¬ 
gorically that all apparent variations of k with stress are due to this cause. 

As well as the problem of fixing a time zero, there is also the question of the strain 
zero. For simple elastic systems this is easy, but for complex systems zero strain 
means no more than the shape of the test-piece at the start of the experiment, 
assuming that the material is then in the normal rest condition with which the test 
is reqtmed to deal. As Love (1906) puts it: ^The unstrained state.. .is, as it were, 
an arbitrary zero of reckoning, and the choice of it is in our power.’ 

The presence of undissipated internal stresses, which would lead to the fixing 
of a zero of strain different from that selected if the material were devoid of in¬ 
ternal stresses, would produce a curvature in the log cr/log i curves assuming a 



83 


Limitations of the Newtonian time scale 

normal Nutting behaviour for the unstressed system. Furthermore, the source of 
"error’ would lead to a modification of the Nutting equation precisely the same as 
equation (8). 

This suggests that the presence of a significant second term in equation (8) may, 
at least in some cases, be indicative of undissipated internal stresses, a point which, 
if verified, would be of considerable practical importance. 

Unfortunately, we have neither time nor suitable materials to test this hypothesis 
adequately, though preliminary experiments are sufficiently encouraging to warrant 
farther investigation. 

Alternative treatments 

As pointed out in the introduction, it is possible, by postulating enough exponen¬ 
tial terms, to obtain an adequate fit to almost any 8 [t curve. In order to get accuracy 
comparable to that derived from the two-term form of equation (8), either two or 
three exponential terms are generally required. The equations, which were calculated 
by Whitehead^ (1935) method, are of the form 8 = which means 

that a two-term exponential involves five parameters ("properties’) and a three- 
term involves seven. The work involved depends on the method used. With White¬ 
head’s graphical method, the calculation takes less time than that for tMfe two-term 
equation (8), but it must be remembered that the latter calculation gives a statistical 
fit whereas the former does not give the errors in the fitted parameters. 

As a second alternative, Alexandrov & Lazurkin’s (1940) method was tried, though 
in the majority of cases the assumptions that there is an easily recognizable "im¬ 
mediate’ elastic reaction cTq and that, under constant stress, the strain approaches 
asymptotically to an equilibrium (maximum) value (cToo), are so wide of the mark 
for most of the materials tested by us that it is rarely possible to get any fair measure 
of o-Q or For elastic bitumen, for which these constants could be determined 
with some confidence, the fit of the Alexandrov-Lazurkin equation was definitely 
bad. 

Apart from the earlier work already published, we had some 173 straining and 
455 relaxation curves suitable for a complete statistical treatment using equation (8). 
These data cover some thirty-eight "materials’, counting characteristically different 
members of the same family of polymers as distinct "materials’. 

Statistical treatment 

Under constant stress conditions, equation (2) gives linear logcr/log^ curves, and 
imder constant strain conditions linear log 8 flog t curves. The constants of these 
equations have been determined by fitting a straight line to the data by the method 
of least squares. It was assumed that the log t values were known accurately in both 
cases and the error was m log cr and log S respectively. 


6-2 



84 


G. W. S. Blair, B. C. Veinoglou and J. E. Caffyn 


The constants of equation (8), (for tension) using the first two terms, were deter- 
mined by transforming it to 

logo* = ^log 8 +logA+k'logt+log[l + £lAt] 
or logo* = ^log 8 +log A+ k'logt+BMI At 

to the first approximationj provided BMjA < 1, t > 1 and M = log^ 10. This equation 

can be written , _. _ 

2/ = a4-6a; + cxl0’-®, 


where y = logcr, x = \ogt, a = log/S + logj 4 , b = k\ c — BMjA^ 

The values of a, b and c were determined by the method of least squares, assuming 
X to be known exactly. 

A similar transformation can be made for constant-strain conditions (relaxation) 
and the constants determined in the same way. 

The summarized results of this analysis are given in table 2. 


Discussion of besults 

In the case of the simple Nutting equation, the constant k is directly derived from 
the rate of straining of a single test-piece tmder constant stress, and the rate of 
dissipation of the same test-piece gives kjfi. The calculation of yff, on the other hand, 
involves data obtained from a number of test-pieces, and there is no doubt that the 
very much higher standard error in yff is due to the fact that lack of homogeneity of 
the samples &om which the test-pieces were taken is very much the limiting factor 
in defining the accuracy of comparison between different materials. For this reason 
it can hardly be over-emphasized that when it is stated that the introduction of a 
second term significantly improves the fit for a given curve, that does not mean that 
the unmodified Nutting equation would be inadequate if the purpose of the tests is 
to compare one material -with another. The adequacy of any equation for this purpose 
depends on the homogeneity of particular materials. If two strips from the same sheet 
of plastic differ appreciably, as is often the case, there will be little point in improving 
on the Nuttmg plot in order to get effective constants for the material unless the 
second term is very much larger than that which would necessitate its use if only 
individual curves were considered. 

Having given this warning it is now possible to proceed to discuss the proportion 
of cases in which individual curves can be improved by the use of a second term. 

In the straining experiments (tension and compression) 66 curves out of 173 
(^^ %) show significant improvement on fitting a second term. In terms of materials 

47 % of materials are significantly improved in less than 26 % cases, 

24 % of materials are significantly improved in 25 to 75 % cases, 

29 % of materials are significantly improved in more than 75 % cases. 

In all cases where there was no significant ‘improvement’ it is implied that^the 
Nutting fit was adequate withm the limits of experimental error. There were no 



Table 2. Mean Nutting constants of materials and significance of addition of a second term 


Limitations of the Newtonian time scale 


85 


0 CC*Sl>«iOTi<OOTH<0 

OOC5T*<OG0Oa5O00 

«-t O O O i-l o00 r-i o 

6060606006 

C5CD*>(N'si<OOOeO»OCO 
CS^ GOC»OOt>GOI><pOOa5 
666666f^6A6 


©Q CO CO »o 
00 <M t- OS ril 

"rj* o rl ©q O 

6 A66 6 

JO o os 

O rH 

6 6 6 




6 ^ 

, rH pH CO iO 05 00 O ©q CO O'<*1 O CO O O O O CO OS 
©X CO I> CO CN CO I> <N CN >7- CO iO CO 50 O O 

|Hcs^ 6666666666rH6666666 


^ ©q 60 I-H rn 05 00 r-l U3 05 Os 00 l><M O t-00 05 00 
Oli. Cq 6 C>>-<Oi-HOTi<aS 05 00 05 t^l>l>I> 000 SC 0 
pHOi-HOi-lrHCqOOOcqOqCpOCNrHi-HO 
6666666666666^6666 


05 O 
00 O 
6 


000 , , 

CO Til cq 


50 o t- 
©q 05 50 
r—1 CO ©q 

66 6 


gg'S.&l 

^ Mca 


OOI>050Gqc050Ti<©q05000500COOTil 

50 QCXTll 50 t>fiq» 0 TiU 060 Til 00 OOfl 050 f-l 


05 o o 
©q o o 


r>©q lOiOTdlOqt^rUcOlOOS 
05 50 [ 10 ©q Til Til o TilO 

00 I O O O O O pH ©q rH 

66 666666666 

05 0 , ©q 50 50 o 50 o X CO ©q 

1 >1> hN 05 o M l> O I> T# I> 

pH rH pH 0 ^ pH pH O ©q pH Cq 


' lOpHOOSesOpHJOpHTtiTili-ll© 

f-HCqjOOOpHpHPHCOrHPH'^Cp 

66gs*6666666666 


^5OC05O5OpHIOXt#005O5O5O 

5ococqTiicq'iiicoxTiixi>i>rH 

OOt^OOOOOpHOOphN 

6666666666666 


XOO50C0l>C050O©q50 50O 

i t> o s<5 CO 50 CO ©q o 00 ©q o 




© 

So 
ti . 

I 

S O 

§ 


ifilS i 

O O a .2 ;3 
5 


if :i| I 

5^ ffiCS § 


5 pH CO l> X T# CO O pH CO ©q CO pH o O pH Til 05 lOOO 
pH pH pH [ .H 


l> I> ©q 50 pH pH pH ©q X X ©q X X I> 50 tH CO Til rH O 


*0 

u 

o 


000000 


50 l> 
I> X 
6 6 


13 5+-( 

o; 


I 


^pH XXTilcOXX 

§ X 


CO 05 50 X X 
j pH ^ pH ©q pH 

^ 6 pH X 6 6 


X pH ©q 50 pH 


pH ©q X 

pH pH O 

66 6 


Oq0505050050pHX5050 
0550pHX50 50 05pH05X'^ 
‘ 6666ph6phph66ph 


[ t'cqoxcoxTH©q©q5ocq 

i pH 50 


5D500i-HTillOOXX 

5OO5XI>I>XXt>i05 


XOOOO©qTiH>50©qOO50!>t>OpHCp*Cp«XXO50XpH50 05TilpH 
pH®p 05 '^ 50 XOpHXX 05 l>pH 0050 XOpHOOpHXa 550 ©qXC 5 
.16 


‘ OOOOOOpHOO OOpHOCqOOi-lpHOOOOpHOqpHpHrH©qpHpHpHpHOOpHpHp 


'S Hh . 


Wo 

8x 


05 XTiiiot'XOcqt> 

XXTiipH©qcqpH©ij©q 


5000pH0550XXpHOX050®i>0»00550©q05t-05' 
Xl>THpHTilTil50pH50XX©qOX50 05Xi-i50 50pHpHpHC 


OOOOOOpHOO 60 pHphOOOOOOOpHphOOpHOOOOOOOOOOOOO 


50 X 05C*p-IX05050 
pHOpHXpH05XI>05 
pHpHpHOpTOXOO 
666666666 


05 X 50 l> 5 DXX©qTilpH 0050 X©qi>CqC^ 50 TilpH 005 JOpHTil 5000 

xt> 50 xxpH©q©q 50 cq»o©qt>i> 5 oxxxi>xcqx©qo 5 ©qxxo 505 

cqcq5oi>TiipH©qOpHpH'<^'^cocoo5t>ppiNppppppHpHppHpH 

66666666666666666666666666666 



I 

. bO . 

© .S -H ^ ^ o 

^ ® I O.ll II 

g| Scrt g'^ 


000505005 DOCq I 000X000000X00X000000050X0X0000 

X 05050 50 50x©q looox oo xoox o o ©q x 


©qpHOpHpHpHpHc-t* ooo©qpHooxeqcqpHOoeq«qoc<*'^OTiuoxTiicoTiicoxTH 50 


xO'-H©qaqpHcqx©q ©qcqxpHOX 5 ooooir'X'^i-HOcqoo 50 oopH©qoxoooo 




’b *?s * 5 ^ *5^ 'i 

© < 


) + 3 3 ^ ©q 

. J © § g 6 'X'h 

l©©oj'S’3pg©© 

111^11;::;-; 

■S'll'S'S'o'SI'S^II^ J' 3 - 3 ’§■§ 11 - 1.1 JJ alw J J ill 133 

p 4 P 4 p 4 pkfiiPfp 4 p<p 4 p: 4 P 4 {iip 4 s:ii© s h P3!r:r:r a ^ 



fQ pcj pq <5 Nj 

- - O o|x,<< 

9 9 


© 


il|i 

© i 

uisi 



86 G. W. S. Blair, B. C. Veinoglon and J. E. Caffyn 

cases of bad Nutting fit where the introduction of a second term did not effect an 
improYement and only a very few (and then only in relaxation, e.g. see table 1) 
where the fitting of one further term did not give a statistically satisfactory fit. 

In the relaxation experiments, the simple Nutting equation was less adequate 
than in straining. Thus, of 455 relaxation curves analysed, 270 ( 59 %) were signi¬ 
ficantly improved by the addition of a second term. 

Only 9 % of materials tested required a second term in less than 25 % cases, 

54 % of materials tested required a second term in 25 to 75 % cases, 

and 37 % of materials tested required a second term in more than 75 % cases. 

It will be recalled that in the preliminary relaxation experiments in which the 
initial straining was very rapid, for the few materials tested, values of ijr, k and ^ 
were strikingly similar when obtained by straining at constant stress and by relaxa¬ 
tion, With the slow controlled pre-straining of the later experiments, there were, in 
general, greater divergencies. Confining our comparisons only to materials for which 
the unmodified Nutting equation holds adequately, some results are given in 
table 3 . 

Table 3. Compaeisok of constants (means) obtained in 

STEAINING AND EELAXATION 



straining 

relaxation 

straining 

relaxation 

straining 

relaxation 

material 

log^ 

log^ 

k 

k 



sulphur-free rubber 

7*75 

8*56 

0*022 

0*046 

1*07 

1*19 

elastic bitumen 

5-82 

6*46 

0*121 

0*112 

0*82 

0*89 

butyl rubber 

9'44 

11*65 

0*037 

0*062 

1*37 

1*70 

neoprene A. 

8-24 

8*88 

0*034 

0*056 

M3 

1*22 

thiokol F.A. 

6*05 

5*44 

0*095 

0*067 

0*88 

0*75 

G.R.S. (synthetic rubber) 

9*49 

10*40 

0*085 

0*105 

1*29 

1*43 


Conclusions 

We conclude that for the practical comparison of the rheological behaviour of 
many materials of the kind which we have studied, the Nutting treatment is not 
only adequate and relatively simple, but expresses those quasi-properties of materials 
which are often important in industry and are certainly used extensively for assessing 
their quality. 

The hmiting factor at present is lack of homogeneity of samples, and, in some cases, 
latent internal stresses. Where these difficulties can be overcome, it is worth while 
in a fair proportion of cases to extend the Nutting treatment by introducing the 
second term in the series equation of which it forms a part. Even then, the properties 
of the material are described by only four constants, i.e. less than the five constants 
required for even a two-term exponential treatment. 

On the theoretical side much remains to be done, but we are convinced that a 
sound start has been made in approaching the problem in terms of the fundamental 
definition of the equality of time intervals. 



87 


Limitations of the Newtonian time scale 

The use of constant-strain relaxation as a method of studying the rheology of 
plastics is open to various objections. It is likely that the method will always be 
secondary to direct straining methods, but we feel that it has its uses, especially 
when it is desirable to measure properties or quasi-properties by two quite inde¬ 
pendent methods. 

We would express our best thanks to the British Electrical and Allied Industries 
Research Association for permission to publish this work and to many friends, too 
numerous to mention by name, who have supplied us with materials and helpful 
advice. But we would especially thank Dr S. Whitehead, Dr P. White and Professor 
H. Dingle for their help and encouragement, and Miss M!. Baron, Mr W. Lethersich 
and Miss J. Hazeldean for carrying out many statistical calculations. 


References 

Alexandrov, A. P. & Lazurkm, J. S. 1940 Acta Physicochim, U.P.S.S. 12 , 647. 

Bergson, H. 1889 Pssai sur lea donnees immediate de la conscience, p. 115. Transl. Pogson 
{Time and free will), London, 1910. Swan Sonnenschein and Co. 

Blair, G. W. S. 1938 Introduction to industrial rheology, London: J. and A. Churchill. 

Blair, G. W. S. 1944 J. ScL Instrum, 21 , 80. 

Blair, G. W. S. & Coppen, F. M. V. 1939 Proc, Roy, Soc, B, 128, 109. 

Blair, G. W. S. & Coppen, F. M. V. 1940 Brit, J, Psychol, 31, 61. 

Blair, G. W. S. & Coppen, F. M. V. 1942 Amer, J, Psychol, 55, 215. 

Blair, G. W. S. & Coppen, F. M. V. 1943 Amer, J, Psychol, 56, 284. 

Blair, G. W. S. & Veinoglou, B. C. 1944 a J, Sci, Instrum. 21 , 149. 

Blair, G. W. S. & Veinoglou, B. C. 19446 Nature, 153, 165. 

Bosworth, R. C. L. 1946 Nature, 157, 447. 

Bridgman, P. W. 1945 Rev, Mod, Phys, 17, 3. 

British Rheologists’ Club (Committee) 1946 Essays in Rheology, London: Sir Isaac Pitman 
and Sons. 

Caffyn, J. E. 1944 J. Sci, Instrum. 21 , 213. 

Caffyn, J. E. & Blair, G. W. S. 1945 Nature, 155, 171. 

Dingle, H. 1942 Phil, Mag, 33, 321. 

Dingle, H. 1944 Phil, Mag. 35, 499. 

Francois, M. 1928 G.R, Soc. Biol, Paris, 98, 152. 

Gemant, A, 1936 Physics, 7, 311. 

Gemant, A. 1944 Phil, Mag. 25, 540. 

Grunberg, L. & Nissan, A. H. 1945 Nature, 156, 241. 

Hoagland, H. 1935 Pace-makers in relation to aspects of behavior. New York: Macmillan. 
Love, A. E. H. 1906 A treatise on the mathematical theory of elasticity, 2xid ed, Camb. IJniv. 
Press. 

Mooney, M. 1940 J, Appl, Phys. 11 , 582. 

Noiiy, L. du 1936 Biological time. London: Methuen. 

Nutting, P. G. 1921 a J, Franklin Inst, 191, 679. 

Nutting, P. G. 19216 Proc. Amer, Soc. Test. Mater. 21 , 1162. 

Pieron, H, 1945 Presidential Address to French Ass. for Advancement of Science, Paris. 
Poincare, H. 1904 La Valeur de la Science, p. 44. Paris: Flammarion. 

Prache, P. 1930 Caoutch. et Gutta-p. 32, 15003. 

Reiner, M. 1933 Naturwissenschaften, 21 , 294. 

Schrodinger, E. 1944 What is life? Camb. Univ. Press. 

Whitehead, J. B. 1935 Impregnated paper insulation, p. 192. New York: Wiley. 



Kinetic boundary friction 

By J. R. Beistow, Engineering Division, National Physical Laboratory, 

Teddington, Middlesex 

{Communicated by Sir Charles Darwin, F,R.S.—Received 10 October 1945 — 

Read 28 February 1946 ) 

[Plate 3] 

It is shoym that the exhibition of smooth sliding and relaxation oscillations, or ‘stick-slips’, 
under conditions of boundary lubrication, when frictional forces are measured by the 
deflexion of an elastic system, may be explained as due to the dependence of kinetic friction 
on velocity. In the cases giving smooth sliding, kinetic friction decreases as velocity decreases, 
at very low speeds; for the cases giving relaxation oscillations kinetic friction increases as 
velocity decreases. l?hat is, sliding imder boundary conditions is not inherently discontinuous, 
any discontinuous motion being due to the dynamics of the measuring instrument, and is the 
result of kinetic friction increasmg as velocity decreases. Curves of boundary friction 
against velocity, using various sliding surfaces, have been determined for a number of 
lubricants, which show both the above-mentioned types of friction-velocity relationship; and 
the dependence is shown of kinetic boundary friction on molecular weight for a series of 
esters of the fatty acids^ on percentage of fatty oil in a compounded lubricant (actually oleic 
acid in mineral oil) and on temperature for a pure substance and a mineral oil. The measure¬ 
ments with the series of esters show some agreement with results given by Fogg ( 1940 ). The 
mixtures of oleic acid with mineral oil give decreasing kinetic friction with increasing per¬ 
centage of oleic acid right up to 100 % oleic acid. The effect of temperature on the dependence 
of friction in velocity shows that the temperature at which relaxation oscillations first occur 
depends on the speed of sliding, from which it appears that measurements of the temperature 
at which relaxation oscillations start at constant sliding speed (Frewing 1942 ) are not a 
measure of the temperature at which there is a discontinuity in the properties of the boundary 
layer. 


Introduction 

Although numerous investigations into the nature of boundary friction, and friction 
in general, have been made over a period of some 250 years, it is only during the 
present century that any real advance has been made towards some slight under¬ 
standing of the phenomenon of boundary lubrication and the nature of boundary 
lubricating layers. 

Ifow, of necessity, investigations into the nature of friction under conditions of 
boundary lubrication consist mainly of measuring coefficients of friction. Examina¬ 
tions of boundary layers by electron diffraction, etc., are only of a subsidiary nature. 
Hence the investigator of boundary-jfriction phenomena is in the somewhat difficult 
position of trying to obtain fundamental information by means of something (fric¬ 
tion) the mechanism of which is not understood. All the investigator can hope to do 
is to interpret the values of coefficients of friction obtained in the light of informa¬ 
tion available on the physical and chemical properties of thin filma of organic 
substances (lubricants are almost invariably organic) on solid, usually metal, 
surfaces. That useful literature of the desired nature is almost non-existent is well 

[ 88 ] 



Bristoiv 


Proc. Roy. Soc. A, volume 189, plate 3 



FlGtJEE 3 


{Facing p, 88 ) 













Kinetic boundary friction 89 

known. However, the nature of thin films of organic substances on water has been 
very fully investigated since Langmuir’s paper on the subject in 1917 (see Adam 
1938), and this knowledge was drawn on by Hardy (1936) to interpret his friction 
measurements. To explain the continuous change in frictional properties with chain 
length for series of long-cham compounds, fatty acids, alcohols, etc.. Hardy assumed 
that the power to act as a boundary lubricant is due to an adsorbed layer which is 
monomolecular and orientated, as are films of the same substances as condensed 
fiilms on water. Orientation of lubricant layers on metal surfaces has been con¬ 
firmed by X-ray and electron-diffraction experiments by many investigators 
includingTrillat (1925), Bragg (1925) and Clark, Sterret& Lincoln (1936). Anattempt 
to take the analogy between films on water and film on metal surfaces still further 
has been made by Frewing (1942). It is presumed by Brewing that his friction 
measurements determine the temperature at which the orientated boundary layer 
becomes disordered and that surface films may change from condensed to expanded 
types; but it is thought that the experimental evidence is msufiS.cient to warrant all 
the conclusions drawn. 

However, full use of the available variables, which might reasonably be expected 
to affect boundary friction, has not been made until quite recently; in particular, the 
effects of speed and temperature. For example, the greatest number of friction 
measurements made is probably of values of 'static’ friction. In aU Hardy’s work 
values of the coefficient of 'static’ friction were determined,^ the static value being 
that at which motion is first apparent with slowly increasing tangential load. It 
was, however, shown, as long ago as 1877, by Kimball that kinetic friction (F^) 
may decrease with decreasing velocity at very low speeds, and Kaufmann (1910) 
and Jacobs (1912) suggested that kinetic friction tends to zero as the speed falls to 
zero and that at high speed the friction approaches an asymptotic value {Ff) (see 
figure Ic). Hence, for any value of tangential force applied less than uniform 
motion will ensue. This is as Hardy found for the longest chain compounds he used, 
e.g. palmitic acid, but he failed to appreciate that 'static’ friction {F^) might in 
fact not exist, for clearly the value of will depend on the sensitivity of the method 
used to ascertain when motion has started, unless when it would seem that 

no ambiguity could occur. It will be seen later, however, that even ifF^is apparently 
greater than F^ unambiguous determinations of 'static’ friction are not necessarily 
possible. 

If the frictional force is measured with an elastic system, which carries one 
friction surface and presses it against a moving surface, the elastic system may 
perform relaxation oscillations, called in the literature 'stick-slips’ (see Bowden & 
Leben 1939, 1940; and Muskat, Morgan & Eeed 1941). The term relaxation oscilla¬ 
tion is preferred to 'stick-slip’, since 'stick-slip’ appears to be associated with 
'fiuctuations in friction’ rather than with the motion of an elastic system. At first 
an explanation of these relaxation oscillations—^and also of the cases where no 
oscillations occur—^was sought in the difference between ' static ’ and kinetic friction 
and the dependence of kinetic friction on velocity. In actual fact the relation of 



90 


J. R. Bristow 


‘static’ friction (if it exists) to the work discussed later proves to be obscure, and 
simple considerations of the dynamics of the apparatus used and examination of 
the results obtained indicates that, if relaxation oscillations occur, it is sufficient 
that kinetic friction increases with decreasing velocity, at low speeds (see figure 1 b), 
and, if no relaxation oscillations occur, that kinetic friction decreases with decreasing 
velocity, at very low speeds (figure Ic). The presence or absence of ‘ stick-shps ’ at 
any particular temperature is therefore dependent on the speed of sliding, the 
dynamics of the instrument and the nature of the dependence of kinetic friction on 
velocity. The work of Bowden & Leben (1939, 1940) and Muskat et al. (1941), and 
the work of Frewing (1942) on the effect of temperature on bormdary lubrication all 
suffer from the failure to consider the d3mamics of the instrument used. 

In the following pages elementary considerations of the behaviour of the friction 
measuring instrument used, and several representative examples of kinetic boundary- 
friction determinations are given. The discussion also covers the apparatus used by 
Muskat et al. (1941), Bowden & Leben (1939, 1940) and Frewing (1942). It is con¬ 
sidered that as yet knowledge of the phenomena exhibited under boundary con¬ 
ditions is not sufficient to make it possible to deduce from the results obtained a 
reasonable theory of either kinetic boundary friction or the nature of the molecular 
layers responsible for boundary lubrication. Hence the results are given quite 
separately and discussed in coimexion with comparable results of other workers 
and not in relation to one another. 


Elementary theory oe the behaviottr oe an elastic system deflected by 

BOUNDARY-FRICTION FORCES AT A MOVING SURFACE, AND PERTINENT EXPERIMENTS 

The considerations put forward here concern the motions of an elastically re¬ 
strained upper surface which are produced by the frictional forces at its point of 
contact with a lower moving surface against which it is pressed. The simple assump¬ 
tions made in the first instance are that static friction (J^) is greater f-,lia.n kinetic 
friction (F^), and that kinetic friction is independent of velocity (see figure 1 a). 
This case has been considered in some detail by Thomas (1930), and it is easy to 
show that the elastic system exhibits relaxation oscillations for slow sliding speeds. 

Consider the upper friction surface to be in its undeflected position; then, as the 
lower surface moves it carries the upper with it until the spring force (F) (plus a 
small force due to viscous damping) exceeds the static friction force and slip takes 
place. The system will now perform oscillations according to the equation, with 
suitable choice of units, 

z + kx+x = ±F^. 

If Fs; is not much less than only one swing is made about -i-F^- before the 
system comes to rest relative to the moving surface in such a position that F hes 
between -i- F^ and F^ and oscillations must cease. Then the upper surface is 
carried forward again by the lower until Fg is again exceeded and the cycle repeated. 



Kinetic boundary friction 91 

This is illustrated in figure 2 for various amounts of damping. If the damping is 
such as to make the system aperiodic, after the first deflexion and slip, the deflexion 
will become constant at 


Friction 


(a) - 




J 1 

1 

- 


Frickton 



s 


f 

f ^ 

/ 

J 

Velocity 

-fh 



Figtoes 1 a, 6, c 



If clearly no oscillations occur, the deflexion being stable at 

If the system is virtually undamped, the natural period of the elastic system short 
and the velocity of the moving surface low, the relaxation oscillation will be ^saw¬ 
toothed’ in shape, as are the * stick-slip’ figures given by Bowden & Leben {1939). 
That the motion durmg slip is approximately sinusoidal has been shown by Bristow 
(1942) and Muskat et aL (1941).* If the velocity during 'slip’ is sufficiently high a 
noticeable temperature ‘flash’ may be produced by the work done against the 
frictional force. Calculations of the rise of temperature to be expected on the lines 

* A case shown by Muskat et al, ( 1941 ) (figure 7, their paper) where is small compared 
with Fs shows two ‘swings’ between each slip about first +Fjs; and then about “Fjr, in¬ 
dicating that Fg does not come into play immediately because the relative velocity between 
the surfaces is zero, and that the condition of ‘stick’ is that F falls between and 



92 


J. K. Bristow 


given by Blok {1937) and Jaeger (1942) give values of the same order of magnitude 
as those measured by using sliding contacts of dissimilar metals as thermocouples. 
The highest temperature flash that has been recorded on an instrument identical 
with that of Bowden & Leben is about 30 ° C. It would appear that the observations 
published by Bowden & Leben (1939) are sufficiently explained by the above 
considerations. | 

However, further observations of the motions of such an elastic system indicated 
that the above simple assumptions regarding boundary friction were only a first 
approximation. 

The important observations were: {a) if the system is damped to be aperiodic, 
oscillations did not cease; (6) with a virtually undamped system the size of the relaxa¬ 
tion oscillations decreases as the speed of the moving surface increases, eventually 
becoming vanishingly small; at stiU greater speeds quasi-sinusoidal oscillations may 
be maintained in the system; and (c) the total movement of the elastic system during 
'stick’ was less than the movement of the lower surface in the same time, and 
decreased with increase of lower surface speed to become necessarily zero when 
oscillations ceased. From these experimental observations it is clear that is not 
independent of velocity, and it appears that, in fact, F;g^ decreases rapidly as velocity 
increases, quickly becoming sensibly constant (see full-line figure 16 ). It may readily 
be shown that if varies as in figure 16 , the elastic system may exhibit relaxation 
oscillations depending on the amount of viscous damping. 

Let the frictional force be F{v), where v is the relative velocity between the two 
surfaces. Then the equation of motion, when the lower surface is moving with velocity 
F, with suitable choice of units, is 

x-\- 1 cx+x = F{V-x). (1) 

Suppose the motion is steady, then x is constant, i.e. x == F(F). This condition may 
be unstable. Suppose a small perturbation given by x = F(F) •+• y, then substituting 
in(l) 

y’¥'lcy-\-F{V)^y = F(F-y). 

Expanding the right-hand side by Taylor’s theorem 

y^hy^-y^- F{F) + [F(F) - 2/F'(F) +...], 
neglecting terms in the expansion after the second 

y + (* + F'(F))y-i-y = 0. 

For varying with velocity as in figure 1 b, F'(F) < 0. Hence if * is neghgible the 
motion is certainly unstable, hut stability may be produced if the value of ife is 
sufficiently great. If F^ varies as in figure 1 c obviously the motion is always stable. 
(Case of Jacobs 1912 and Kaufmann 1910.) 

These considerations and experimental observations agree with the work of 
Papenhuyzen (1938) on the sliding of rubber on glass and other surfaces. Papen- 
huyzen was also able to show, however, that on reducing the speed of the lower 



Kinetic boundary friction 93 

friction surface sufficiently, oscillations ceased and smooth sliding ensued at values 
of friction (Fs:) which decreased with speed. This is indicated by the dotted extensions 
to the curves in figure 16 . 

In concluding this section it may be emphasized that on the foregoing analysis 
relaxation oscillations (or 'stick-slips’) are fundamental to the type of instrument, 
having regard to the dependence of friction on velocity and the dynamics of the 
elastic system, and not that 'these discontinuities are inherent in the sliding of 
metals ’ (Bowden & Leben 1939). Hence it is not possible to deduce from the papers 
of Bowden & Leben (1939) or others describing work with similar apparatus any 
information regarding kinetic friction other than that F^<Fq, if 'stick-slips ’ occur. 


Expebimental 

The friction-measuring apparatus was designed to determine curves of boundary 
friction against velocity at low speeds on the basis of the foregoing analysis, which 
indicates that to determine the period of the elastic measuring system is not of 
primary importance but that high damping is required to make the system stable. 

A general view of the instrument is shown in figure 3 , plate 3 , and the essential part 
of the system diagrammatically in figure 4 . The tangential forces due to friction are 
measured between the upper spherically ended surface. A, and the lower flat 
surface, B, The use of a spherically ended surface on a fiat in order to obtain boundary 
conditions is as described by Hardy (1925). The flat surface is carried on a trolley, 
T, driven at various constant speeds by hydraulic ram designed to impart as smooth 
a motion as possible to the trolley. The driving rod, B, of the ram, is just visible in 
figure 3 . The speed is controlled by the rate of flow of the hydraulic fluid into the 
system. In order to work at temperatures above normal room temperatures, a 
heater, E, is built into the troUey; the cooling system, ( 7 , beneath the heater is to 
keep the heat localized to the friction surface. The trolley wheels run on horizontal 
rails, one of which is a V-groove to define the direction of motion. The speed of the 
lower surface is measured by timing over a fixed distance by a stop-watch electrically 
operated by the trolley. The top spherically ended surface is carried on the rigid 
frame, F, attached to the four wires W, X, T and Z, which are arranged to be in 
equal tension in pairs (i.e. Tw = Tx, Ty = Ty) and are tensioned by the spindles, F. 
Then, with the top surface fixed on the centre line of the frame, F, it will be deflected 
parallel to the direction of motion of the trolley by the forces at its point of contact 
with the lower surface. This arrangement was adopted, as it was at one time thought 
that twisting of the surfaces relative to each other might produce increased adhesion 
between the surfaces (Bristow 19-^2). This was later found to be an unnecessary 
refinement. The elastic system is heavily damped by the arrangements D, D, each 
of which consists of a piston attached to the carriage and moving in a cylinder, 
carried on the body of the apparatus. There is a small clearance between piston and 
cylinder which is filled with oil. Deflexions of this elastic system are measured by 
a mirror-lamp system M, and are observed visually on a scale (not shown). The 



94 


J. R. Bristow 


deflexions are calibrated in terms of force by pulling the system forwards by a string 
attached to the upper friction surface which runs over a pulley (not shown) to a scale 
pan. Load is applied through the cantilever, L, by turning the screw, which bears 
on the frame, F, The load is determined by measuring the force necessary to lift 
the top surface, out of contact with the lower by means of the weigh beam, JE/, The 
observations taken are deflexion and time to travel a certain distance, from which 
the coefficient of friction and velocity can be determined, and curves of friction 
against velocity constructed. In some cases, with the heaviest convenient damping, 
small oscillations occur at the very lowest speeds. For such cases the curves through 
the experimental points are drawn through the mean of the points obtained from 
the maximum and minimum deflexions during the oscillations. This does not 
produce any great uncertainty in the curves, as in such cases the coefficient oj 
friction is always rising very rapidly as velocity decreases. 



Figxjbe 4. Kinetic friction apparatus. Details of suspension. 


Two methods of cleaning the friction surfaces have been used: (1) polishing with 
0000 emery paper, scrubbing in strong caustic soda solution with cotton-wool 
until the entire surface is completely wetted with water, washing in water and 
drying with acetone-; and (2) cutting the surface -with a clean tool as described by 
Ernst & Merchant (1940). In general the first method has been used on steel surfaces 
and the second on soft materials. All the results given were obtained under conditions 
of constant load and -with excess lubricant present. 


Results 

A selection of curves of coeflacient of friction against velocity are given here which 
are directly comparable with other pubHshed work on boundary friction, and which 
lUustrate the discussion already given. It will be seen that these results do not 
conflict with those ^ven by other workers, but do add information which is not in 
agreement with the interpretations given by previous investigators. • 



Kinetic boundary friction 


95 


Results are given for: 

(i) A series of ethyl esters of saturated fatty acids for two pairs of sliding surfaces, 

(1) hard steel on hard steel, and (2) phosphor bronze on hard steel, at room tem¬ 
perature. 

(ii) A mineral oil containing additions of various percentages of oleic acid, with 
sliding surfaces of (1) steel on steel, and (2) brass on hard steel, at room temperature. 

(iii) For (1) ethyl palmitate lubricating the sliding of tin on hard steel and for 

(2) a mineral oil used as lubricant for steel sliding on hard steel, at various tem¬ 
peratures. 


(i) The kinetic boundary friction of a series of esters 

The series of esters used is given in table 1, and the measurements were made at 
room temperature. The results of the determinations are given in figures 5 and 6, 
the curves being numbered for the corresponding lubricants as in table 1. The nature 
of the results may perhaps be seen better in the plot of coefficient of friction against 
molecular weight given in figures 7 and 8. 


Table 1. Esters used 


number 

ester 

foimiila 

molecular weight 

1 

ethyl formate 

H.CO.O.CjHg 

74 

2 

ethyl acetate 

CHs.CO.O.Cj^s 

88 

3 

ethyl propionate 

CjHs.CO.O.CA 

102 

4 

ethyl valerate 

C^Hs.CO.O.CisHs 

130 

5 

ethyl oenanthate 

CeH„.CO.O.C,H 5 

158 

6 

ethyl capryllate 

C^His.CO.O.CaHs 

172 

7 

ethyl pelargonate 

CjH^.CO.O.CjHs 

186 

8 

ethyl laiarate 

CuHjj.CO.O.CaH, 

228 

9 

ethyl palmitate 

C 15 H 31 .CO.O.C 2 H 5 

284 



Figure 5. Hard steel sliding on hard steel lubricated with ethyl esters 
of saturated fatty acids at room temperature. 






tk/oc/^ - C/77 per sec 

Figure 6 . Phosphor bronze sliding on hard steel lubricated with ethyl esters 
of saturated fatty acids at room temperature. 




8 . Phosphor bronze sUding on hard steel lubricated with ethyl esters 
of saturated fatty acids at room temperature. 





Kinetic boundary friction 97 

The results given in figure 7 for hard steel on hard steel are comparable with those 
of Fogg (1940) for 'static’ friction measurements on the same series of esters, using 
a modified Deeley machine. Fogg’s measurements gave a curve of coefficient of 
' static ’ friction against molecular weight similar to the one given for a constant 
velocity of 0-01 cm./sec., with the maximum of the curve between ethyl propionate 
( 3 ) and ethyl valerate ( 4 ). Roughly the explanation given, on the assumption that 
the coefficient of friction is inversely proportional to the distance between the two 
friction surfaces as determined by a monomolecular adsorbed layer on each surface, 
was that the esters are attached to the surface by the double-bond oxygen atom in 
the molecule with the two branches (oxygen atom and ethyl group, and varying 
fatty acid chain) standing at angles determined by their relative weights, the 
greater the difference in branch size the more perpendicular the longer branch 
stands to the surface. Hence a minimum thickness of adsorbed film may be expected 
between ethyl propionate ( 3 ) and ethyl valerate ( 4 ), at which the maximum friction 
of the series was found. 

However, for the determinations of kinetic boundary friction a maximum is 
only found at slow speeds for hard steel on hard steel. At high speeds, the friction 
falls continuously with increasing molecular weight for hard steel sliding on hard 
steel, and for phosphor bronze on hard steel no maximum is shown even at very low 
speeds (in this latter case the points are somewhat scattered). 

It is apparent that the explanation given by Fogg is insufficient to explain all 
of the above results. 

(ii) The kinetic boundary friction of a mineral oil containing additions of oleic acid 

The results are given in figures 9 and 10. It will be seen that the coefficients of 
kinetic boundary friction are continuously reduced by increasing additions of oleic 
acid to the mineral oil, the oleic acid having good properties as a boundary lubricant, 
and the mineral oil poor properties. The effect of additions of fatty oil is greatest at 
low concentrations, and it will be noted that at high concentrations of fatty substance 
the friction-velocity curve changes in tjpe from that shown in figure 16 to that shown 
in figure 1 c, for brass on steel, but not for steel on steel. It is clear that compounding 
a mineral oil with a fatty oil produces considerable improvements in boundary 
lubricating properties even for small additions, say up to 5 % fatty oil, but that the 
friction still continues to decrease slowly right up to 100 % of the fatty constituent. 
At high speeds the effect of additions of the fatty substance tends to decrease. 

Such a result is to be expected, since the boundary layer will probably always 
contain a greater percentage of the more' active ’ constituent than the solution. But 
apparently even at high concentrations of the 'active’ substance (oleic acid) the 
boundary layer stiU contains an appreciable amoimt of the less active substance 
(mineral oil). The effects to be expected will, of course, depend on the relative surface 
'activities’ of the constituents of any particular mixture. 

In figures 11 and 12 coefficients of friction are plotted against concentration of 
oleic acid in mineral oil at certain constant speeds, and show very clearly how small 


Vol. 189. A. 


7 



J. R. Bristow 


98 

amounts of oleic acid reduce the friction very considerably. These graphs may be 
compared with those given by Fogg & Hunwicks (1940) for additions of rape oil to 



Figitbe 9. Brass sliding on steel lubricated with mineral 
oil—oleic acid mixtures at room temperature. 



oil—oleic acid mixtures at room temperature. 


a mineral oil and oleic acid to B.P. paraffin, for determination of ^ static ’ friction 
with a modified Deeley machine. The curves given by Fogg & Hunwicks are the 
same shape, and show the same facts as those given in this paper for low velocity. 











C0€ff)iU€nC ^ Fncti^n 


100 


J. R. Bristow 


(iii) The, effect of temperature on kinetic boundary friction 

The effect of temperatnre on kinetie boundary friction for various surfaces and 
lubricants appears to be somewhat complicated, and the discussion here is confined 










Kinetic boundary friction 101 

to two of the simpler cases investigated, namely, {1) tin sliding on hard steel lubricated 
with ethyl palmitate, and (2) steel sliding on hard steel lubricated with a thin mineral 
oil. The friction velocity curves for several temperatures are given in figures 13 a 
and b and 14 a and b. Figures 136 and 146 give the low-velocity ends of figures 
13a and 14a in greater detail by plotting velocity on a larger scale. 

(1) Ethyl palmitate lubricating tin on hard steel. This case is of particular interest 
in relation to the work published by Tabor (1940,1941)9 Frewing (1942) and Hughes 
& Whittingham (1941). 

At the higher velocities the coefficient of kinetic friction decreases with increase 
of temperature; at the lower velocities, however, the coefficient of kinetic friction 
increases, the increase extending to greater speeds as the surfaces get hotter. The 
change of shape of the curves of friction against velocity shown at very low velocities 
necessarily means that the tendency to give relaxation oscillations increases as the 
temperature is raised. The crosses on the graphs of figure 13 indicate approximately 
the highest velocity at which relaxation oscillations are first apparent. The velocity 
at which the oscillations vanish is, of course, dependent on the amount of damping 
employed. It is clear, therefore, that if the lower surface velocity in the apparatus is 
constant at, say, 0*01 cm./sec., no relaxation oscillations wiU occur even if the 
temperature is raised to 170® C. On the other hand, if the surface velocity is less 
than about 0*002 cm./sec., relaxation oscillations will occur at room temperature. 
It is clear that over this range of temperature ‘stick-slips’ can start at any desired 
temperature as the surface temperature is raised by choosing the correct velocity 
of the lower surface. Using steel on steel lubricated with ethyl palmitate the change 
in shape of the curves at very low velocity is essentially the same as for the above 
case. 

That some change is taking place, as the temperature is raised, which is connected 
with the nature of the molecule of the lubricant and surfaces concerned, cannot be 
doubted. But that this change is necessarily connected with the sudden disorienta¬ 
tion or desorption of an adsorbed film, and that the temperature is characteristic 
of the lubricant (i.e. that there is some sudden change at this temperature) as 
suggested by Tabor (1940, 1941) and supported by Frewing (1942), cannot be 
maintained. In this connexion it may be noted that the values given by Hughes 
& Whittingham for the temperature at which ‘stick-slips’ start, with fatty 
acids as lubricants, are between 20 and 50® C higher than the values given by 
Frewing. 

(2) A mineral oil lubricating steel on hard steel. For this case the coefficient of 
friction increases with rise of temperature over the whole of the velocity range used 
(except for 170® C at very low speed). Here again, however, the tendency to give 
relaxation oscillations (‘stick-slips’) increases as the temperature rises. As in (1) 
above it is possible to make the temperature at which relaxation oscillations start 
any desired value by fixin g correctly the velocity of sliding. 

At this stage, it is not possible to give a satisfactory explanation of these pheno¬ 
mena. 



102 


J. R, Bristow 


This paper is published by permission of the Director of the National Physical 
Labora-tory and the Director General of Scientific Research and Development, 
Ministry of Supply, for whom the investigation was made. 


Refbebnces 

Adaan 1938 The physics and chemistry of surfaces. Oxford University Press. 

Blok 1937 Inst. Mech. Engrs.^ Discussion on Lubrication and Lubricants, Group iv, p. 26. 
Bowden & Leben 1939 Proc. Roy. Soc. A, 169, 391. 

Bowden & Leben 1940 Phil. Trans. A, 239, 1 . 

Bragg 1925 Proc. Roy. Inst. 24, 481. 

Bristow 1942 Nature, 149, 169. 

Clark, Sterrett & Lincoln 1936 Industr. Engng Chem. 28, 1318. 

Ernst & Merchant 1940 Mass. Inst. Tech., Proc. Conf. on Friction and Surface Finish, p, 76. 
Fogg 1940 Proc. Phys. Soc. 52, 239. 

Fogg & Himwicks 1940 J. Imf. Petrol, 26, 1 . 

Frewing 1942 Proc. Roy. Soc. A, 181, 23. 

Hardy 1936 Collected Works. Cambridge University Press. 

Hardy 1925 Proc. Roy. Soc. A, 108, 2. 

Hughes & l^Huttingham 1941 Trans. Faraday Soc. 38, part 1 , 9. 

Jacobs 1912 Ann. Phys., Lpz., 38, 126, 

Ja^er 1942 J. Roy. Soc. N.S.W. 66 , 203. 

Kaufinann 1910 Phys. Z. 11, 985. 

Kimball 1877 Amer. J. Sci. p. 353. 

Langmuir 1917 J. Amer. Chem. Soc. 39, 1848. 

Muskat, Morgan & Reed 1941 J. Appl. Phys. 12 , 743. 

Papenhuyzen 1938 Ingenieur, Utrecht, 53, 75, 

Tabor 1940 Nature, 145, 308. 

Tabor 1941 Nature, 147, 609. 

Thomas 1930 Phil. Mag. 9, 329. 

Trillat 102 s O.R. Acad. Sd., Paris, 180, 280. 



A general kinetic theory of liquids. 

II. Equilibrium properties. 

By H. S. Green, EdMurgh University 
{Communicated by M. Born, FM.S,—Received 4 June 1946) 


The object of this paper is to expand and develop the equilibrium theory outlined in the 
first part of this series, within the framework of a general kinetic theory of fluids. 

The entropy of the fluid is first defined in such a way that agreement with the definition 
of statistical mechanics is obtained in equilibrium conditions. It is then found possible, 
provided care is taken to avoid certain well-known fallacies in the application of statistics to 
mechanical systems, to prove that the quantity defined will in general increase with time 
towards a certain maximum. The equilibriiim solution of the fundamental equations is 
thereby derived without any appeal to statistical mechanics. At the same time a direct proof 
is offered of certain general formulae in statistical thermodynamics. 

The second half of the paper is devoted to a more detailed study of the equilibrium state 
by a method proposed in part I of the series. It is shown that the liquid state is distinguished 
from the gaseous state by the existence of real roots of a certain transcendental equation. 
Approximate but tractable expressions are found for all the virial coefficients in the gaseous 
phase. A-n equation of state is derived with two branches, which are identified with the 
gaseous and liquid states respectively. 


1 . Introduction 

In part I of this series (Born & Green 1946), Professor Max Bom and the author gave 
a general survey of a theory designed to describe both equilibrium and dynamical 
properties of liquids, A brief treatment of the equihbrium state was included, 
suf&cient to show that results consistent with the application of statistical mechanics 
could be obtained. There were, however, two important gaps in our outline, which 
it is the purpose of the present paper to fill. 

In the first place, we found it necessary to appeal to statistical mechanics in order 
to prove that the Maxwellian distribution of velocities in a molecular assembly was 
a unique consequence of our equilibrium theory, though we were afterwards able 
to show that by assuming this, the complete^ statistical mechanical solution could 
be obtained. It is obviously desirable, however, to make an exposition of this kind 
independent, as far as possible, of external theories, even though their subject-matter 
may overlap to a certain extent with our own. The first part of this paper is therefore 
devoted to a generalization of the famous Boltzmann J?-theorem (Chapman & 
CowKng 1939), which enables us to establish our treatment of the equihbrium 
properties on an independent basis. At the same time a direct proof is offered of 
certain general formulae of statistical thermodynamics. 

Secondly, although the object of this series is specifically to provide a theory of 
the liquid state, in part I no indication was given of the role played by this particular 
phase in our theory. Indeed, in its present form, it is equally applicable to the 
gaseous phase, and, apart firom modifications probably of a quantum-mechanical 

[ 103 ] 



104 


H. S. Green 


nature, to the solid state as weU. It just happens that there are akeady theon^ 
which adequately describe these other states. However, some explanation of the 
phenomenL of condensation, and the distinction between the gaseous and hqmd 
Ltes, is plainly required, and this it is the endeavour of the second part of the 

'‘TtheoTo^c?ndensation, based on statistical mechanics has already b^n 
elaborated by Mayer and others (Mayer 1937; Bom & Fuchs 1938, Kahn 1938), wo 

showed thatmndensation is associated with the divergence of the series obtamedby 

expanding the normalized phase integral in powers of thedensity. The nature of tks 

sinLarity,andofthecontinuationbeyond,has,however,sofarremamedsomewhat 

obscure. The investigation which foUows indicates that the smgulanty is, m fact, a 
branch point, from which spring two branches associated with the gaseous andhquid 
states respectively.* With the aid of an approximate solution of an mte^^ 
derived in part I, it has been found possible to derive equations of state for both 
phases and any other equilibrium properties which may be required. This part of 
the wok offers many opportunities for numerical comparison with experiment, but 
it has not seemed worth while to specialize in this way at present; on account of the 
approximation involved, qualitative rather than exact quantitati^ agreement 
may be expected. However, the equilibrium theory now seems sufficiently weU 
established to enable us to proceed to the more interesting dynaimcal properties, 
some of which Professor Bom and the author hope to treat in the third part of this 

series. 


2 . Peeliminaey 

A fundamental theorem in the Mnetio theory of gases, due to Boltzmaim, shows 
that a certain quantity H, closely connected with the entropy S, cannot, on the 
average, increase, but must tend to a fixed minimum, associated with the equili¬ 
brium state. Both the definition of H, and the proof of the so-called H-theorem, must, 
however, be generalized to apply to the liquid phase. 

The appropriate definition is suggested by known results of statistical mechanics, 

inthefoUowingway. In equilibrium, the free energy A ofan assembly of W molecules 

is known to be given by 

= 4- ... H (2-1) 

N\j J i^i 

where W = 

in the notation of part I. An independent proof of this result, based on the exact 
formula for will be found in the appendix. Note here that (2-2) implies that the 
molecules possess no internal degrees of freedom, a simplification which will be made 
throughout the present paper. 

• The liquid phn-w is distinguished from the gaseous phase by the existence of real roots of 
a certain transcendental equation. In this respect the theory resembles a theo^ of conden¬ 
sation proposed by Kaplan & Dresden ( 1944 ). In spite of its attractive simplicity, however, 
the latter remains extremely speculative. 



A general kinetic theory of liquids 

It was found in part I that, for equilibrium, 

QfN = 

1 n%me N 


where 
so that 


105 

(2-3) 

(2-4) 




T /•(2isr) /• N 

= -J/ivlogOndxCOd?® 

1 n%N)rj^ \ K 

= -mj ••j(^+iog/.Kn/x<0d?«>. 


s = 


E-A 
T ’ 


The entropy is 

1 (*( 2^0 r N 

where J "* J P 

is the internal energy. Hence, in equilibrium, 

i; nm) r N 

S = - ... J/^log/^ .n 


(2-5) 

( 2 - 6 ) 

(2-7) 

( 2 - 8 ) 


(2*8) will be adopted as the definition of the entropy, even when equilibrium con¬ 
ditions do not exist, and the proof of the jff-theorem will then consist of showing that 


r(2iv) r N 


cannot increase. 

According to {4*5) of part I, the equation satisfied hyf^r is 
dt ax<0^ 35® 0xC« ’ 


(2-9) 


( 2 - 10 ) 


iffN+i is assumed to vanish. The 'auxiliary equations' for the integration of (2*10), 

^ = dx<«/|® = (2-11) 


are identical with Lagrange’s equations for the motion of the system. They yield 
QN fonctionaUy independent integrals of the form 

...,xW,5« =,r = 1, ..., 6 N, 

which, when solved for and provide a detailed description of the motion 
in terms of the time t and the parameters Cy. which are specified by the initial 


I 



106 


H. S. Green 


conditions, is thereby determined as an arbitrary function of the QN variables 
If, which include the total energy W, the total momentum M, and the total 
angular momentum A of the system. This solution is, however, much too 
general for the present purpose, which requires the form assumed by fjf when the 
system has been left for some time, and has arrived at a ‘normal’ state. Also, the 
idea of statistical interaction has disappeared from the equations, which, as one 
might expect, leads to the paradoxical result dH/dt = 0 even when statistical 
equihbrium has not been reached. 

Therefore, instead of (2'10) the equation 


dt dK<f>^ ~ mA05®ax® 



05® 0x(« 




( 2 - 12 ) 


win be considered, which has the effect of providing an external system (to be 
imagined as the vapour with which the liquid is in contact, for example) with similar 
distribution properties, and with which the given system may statistically interact. 
Also, to make the treatment as general as possible, the possibility of a conservative 
field of force — SlF/fix acting on the molecules is contemplated, which requires only 

N 

the replacement of by <5' = 0 + S 5^(x®) in(2-12). This innovation has the advan- 

i=l 

tage that the boundary of the fluid may be represented by a wall where iP~^oo; of 
course, at the wall,/jy and surface integrals involvingmust vanish, otherwise the 
total potential energy of the fluid would diverge. 


3 . The ueheb-auzed JBT-theobem 

Now proceed to calculate the rate of change with time of the quantity H defined 
in (2-9). Thus 

dJS 
dt 


smce 


and 


-=j... 

/•(2-w) r 0 N 

fi2N) rs0 s N 


(3-1) 


both vanish on transformation to surface integrals. and/jv+j are symmetrical 
functions of their positions and velocities, so all terms under the summation in (S-1) 
contribute the same value, and hence 


dH 

ct 





A general kinetic theory of liquids 107 

Tlie analysis which follows is a generalization of that in appendix I to part I 
of this series, with which it should be compared. Write 

X = |{x®)+x(^+W), r = x^+^>—xfi),! 

I = P = 

m = 2 m 5 , a = |mrxp, w — ^p^+^{r). ( 3 - 4 ) 

By solving the equations ( 3 ' 4 ) % and p can be expressed, and therefore and 

in terms of m, a, w and r*. A functioncan then he defined by 

/iv+i = ...,x(^+w, I®, m,a,M;), ( 3 - 5 ) 

and it is easily verified that 


/ %r+i __§fv+x.\ _ o / ^/v+i \ 

m\a?® 05^+1)jax(i) dw ‘^0r"'‘^\0x(A'+u 0x(^'+i)/* 


( 3 - 6 ) 


On substituting 


1 9 /v+i ^ 
m 05® 0x(W 


from { 3 - 6 ) into ( 3 ' 2 ) the only 


term which does not 


vanish is found to be 

/•( 2 V+ 2 )/‘ gj? v+i 

-Wj ... J(H-log/^)p^^>ndx®d5W 


Now, m, a and w may be regarded as geometrical parameters specifying the 
trajectories which would be followed by the molecules numbered (1) and (-S'+1) if 
no other molecules were present. Thus, if the domain be imagined as par¬ 
titioned by tubes formed by the trajectories of {N +1) relative to (I), it is seen that 
the integration over may be performed, as in appendix I to part I, first along 
the typical trajectory, where m, a and w are constant, and then over all values of 
the cross-section radius b. At each end of the trajectory the interaction vanishes, 
and/j^^i factorizes into/^^+’^J/jv- Since the sphere around x^^ in which ^ is effectively 
different from zero is of microscopic dimensions, it need not be supposed that 
differs macroscopicaUy from either x^^^ or x, or that the time differs from at the ends 
of the trajectories. The initial velocites and must, however, be determined 
from the actual, final velocities by the equations, following from ( 3 * 4 ) 


m(x<i> X 5(1)'X 5(^+i>') = m{x(i> x ^d)+x(^+i> x 5(^+i>), 


( 3 - 7 ) 


After the integration, ( 3 * 2 ) becomes 

^ = -W ( 3 - 8 ) 


* Only two components of a are strictly required for this transformation, but for symmetry 
this fact is ignored. 



108 


H. S. Green 


Now denote by Fj^f tbe function obtained from by replacing the variables 
and by and respectively, and rewrite (3-8), using ( 2 - 4 ) of part I, in 
the form 

o TT- lirr r^( 4 iV) r 

- |(i+iog/iv)(/i.J'N-/W)p^^x n dx<«nd?o). (3.9) 
Ct (A^ —1)IJ J i-2,iV'+2 

By a series of transformations familiar in the proof of the iJ-theorem for gases, one 
obtains fimally 

N, 2 N 2 N 

(fy^N-fMp^ibdx n 

i= 2 ,iV’ 4-2 ^*=1 

( 3 - 10 ) 

which makes it clear that dHjdt is essentially negative or zero. 


g^_ N 
It' 4 (N-l)!j 


4. The exact EQHILIBEnJM soltttioh 

It is inferred from the calculations of the last section that it is a condition of 
equilibrium that the entropy should have attained its maximum value, for which 

iogjrv+iogj^ = iog/;;r+iogj’;r, (4-i) 

where =/^(x^+« ...,x(>^, 5 (^+i),-,1^^), 

[ ( 4 - 2 ) 

F'ij = JJvCxC^+W, 5(iV+® 

and equations ( 3 - 7 ) are satisfied. Owing to the symmetry between the variables, 
one obtains from ( 4 - 1 ) by a succession of N -1 arbitrary transformations of the 
type ( 3 - 7 ), 

log/v+logJ’^ = log/^+log F"^, (4-3) 

where ...,5^'), 

(4-4) 

and the only restrictions on the variables are 


2N 2N 

2N 2N 

i“l i=l 

2N 2N 

S = s 

i=l 


i=®l 


( 4 - 5 ) 


It follows that 

log/v = 0+mcS|®+Jnc'Sx<«x|®+|mC"S5«» 

i=l i=l i=l 


N 


N 


N 


( 4 - 6 ) 


where C, c, c' and G' may be functions of the xff>. 



A general kinetic theory of liquids 


109 


To determine these functions, substitute in (2-12), modified for equilibrium 
conditions. Then 

ax(« 

vanishes; the statistical term 

can be reduced to terms of the form 

1 ri2N) c 2N 2N 

-m-TvA - n n 

J i=iff+2 j-=N+l 

by analysis similar to that of the last section, and also vanishes. {2-12) therefore 
reduces to 

y .^/iv /g(i)_gos = i- y j/iv /4.Y\ 

and then, on substitution from ( 4 - 6 ), 

dC 

ax(«“ ’ 

^(c+c'xx(») + ^(c+c'xx(«) = 0, 

0(7 ^ _ 9 

N d0' ^ _ dO 

(c+c'xx(«)^+_S5®^) = 0. 

The first and second of these equations show that O', c and c' are all constants; 
(7' is found to be given by — IjkT, when the temperature T is defined by 

1 m r Jvr _ N 

ikT = ~J ... ( 4 - 9 ) 

Also, by calculation, it is found that = kT{c + c' x so that c and c' find simple 

interpretations as multiples of the velocity and angular velocity respectively; the 
fluid, if in motion, must behave macroscopically like a rigid body. The third of 
equations ( 4 - 8 ), taking account of the second, requires that 

c = O'^ 0 ' + 4m + G", 

N _ 3 _ 

where G" is a pure constant; and the last reduces to S showing 

U^i ^ 

N- _ 

only that the visible kinetic energy S ^ conserved. 

The most general form for when there are no changes in the entropy has 
thus been obtained. 


( 4 . 8 ) 



110 


H. S. Green 


5. The soiitriiON or the httbgral equation 


Now proceed to examine tlie equilibrium state in detail by the method proposed 
in § 5 of part I, namely, by the solution of the integral equation for n^ir ): 


® TcT 




n 


0 


(s2 _ j2) +r) di %(s) ds. (5-1) 


Krst linearize the equation by writing 


n^{r) = +/(?■)}, ( 6 - 2 ) 

and neglecting squares and higher powers of f{r). This procedure will be justified 
if the right-hand side of (6-1) is not too large; hence 


J ~ +’■){! + “(*+»■)} dt a'(s) {1 +f{s)} ds 

-TTaiJ^J {s^-t^){t+r)(x(t+r)dtcd{s){l+f{s)}ds, (6-3) 

where a{r) = — 1. (5*4) 

Now observe that a(r) and a'(r) are both negligible except for small r, so that, 
when multiplied by these factors, f{r) may be replaced by e — 1, its average value 
in the neighbourhood of the origin. Then, after integration by parts. 


’■/(^) = J ^ J (i -t- r) {f{t -I- r) -1- ea{t+ r)} dt ea{s)s ds. 

This equation can be solved by Fourier transforms; wri ting 


(5-5) 


1 r® 

"" (2^J 

1 r® 

(2^*j 


(5-6) 


then ( 5 - 6 ) becomes Agr(r) = {5f(r)+ e/S(r)} e/ff(r), 

where A-^ = ( 27 r)*%, so that 

1 r* e’‘s/3(sf sin (rs)ds 
^ ( 27 r)iJ_„ A-ey?(a) 

It will be supposed that there is an analytic function /d(z) which reduces to /S(r) 
when z = r is real. The most general solution of (5’6) is then given by (5'8), where 
the path of integration may follow any course from negative to positive infinity in 
the complex plane. This is most easily seen by remarking that if/(r) is a solution, 

so is/(r) -}- -^sin (rzft), where is any root of the transcendental equation 


( 5 - 7 ) 

( 5 - 8 ) 


/ff{z) = A/e; ( 5 - 9 ) 

the additional term may, however, be obtained from (6-8) by deforming the path of 
integration to enclose the singular point Zj^. It will be seen in the next section. 



A general kinetic theory of liquids 111 

however, that the precise path of integration is determined by the boundary 
condition 

/(r) = 0(r-^) as r->oo, ( 5 - 10 ) 

which follows from the requirement that J — n|) should converge. (Strictly, 

one should write/(r) = ^>0, but owing to the approximate nature of 

(5-2), this refinement is not compulsory.) 


6. Thebmobynamioal expbessions 
It is now possible to calculate explicitly the internal energy E, given by 

E = \V^ n^{r) <1 >{t) ^iTT^dr + ^NkT, 


and the pressure p from 


UL 

poo 

p = nj^kT—^\ n^{f)<j>'{r)r4m^dr. 


( 6 - 1 ) 


( 6 - 2 ) 


These formulae are known, but a comprehensive derivation is included in the 
appendix. Then, from ( 5 ' 2 ) and ( 5 ' 8 ), 


^ 2A A-e^s) 


+lNkT, 


and 

where 


P 


= in^kT |l 4- 


6 ( 27 r)*A/’ 


( 6 - 3 ) 

( 6 - 4 ) 


/ = j* a'(r) {1 +/(r)} r®dr 
= - J a(r) [ 3{1 +f{r)}+ r/'(r)] r^dr 

-'s(to)t/S(0)-£__[-l^W‘+j^^+jIog|l-|yS(.)|]««ii<«. (6-s) 

Combming ( 6 * 3 ) and ( 6 ' 4 ), it is confirmed that the equation 


■(^)= 


^dT-^ 


( 6 - 6 ) 


is integrable, yielding for the free energy 

—^NkT log T+NkT log »i ( 6 - 7 ) 


apart from an insignificant multiple of T. 



112 


H. S, Green 


Explicit formulae for all the important thermodynamical functions have thus 
been obtained, except that the path of integration, which is the same in ( 5 - 8 ), 
(6-3), (6-5) and (6-7), is so far undetermined. To apply the boundary condition use 
is made of the well-known lemma 

linx r ^^ll^sin(r«)ds = lim?r^(i). (6*8) 

It is necessary also to discuss the dependence of a(r) and /?(r) on the temperature. 
Since ^(r) is negative for all large r, in this range a(r) will become increasingly positive 
as the temperature decreases. It follows that fi{r) will behave in the same way when 
r is small. Referring to (S-O), it is necessary to distinguish between two cases. For 
sufficiently low densities and high temperatures. A/e exceeds the upper bound of 
/?(r), and‘(5*9) has no real roots; the path of integration may then be taken along the 
real axis, since (5*10) is thereby satisfied. In the region of higher densities and lower 
temperatures, however, ( 5 * 9 ) will have a pair of equal and opposite real roots (since 
fi{z) is an even function of z), and in this case the path of integration must be taken 
just above both singularities on the real axis. On such a path, 

p sin (rs) ck _ s{}lr{s) — 'i/r{$Q)}sin{rs)ds 

so that the singularities in the integrand do not affect the behaviour of the integral 
for large r. 

The possibility that the two cases described above correspond to the gaseous and liquid 
phases respectively has already been expressed, and in the following sections con¬ 
vincing evidence will be given to confirm this view. 


7. Development eor the gaseous phase 


The expressions found in the last section for the thermodynamical functions are 
quite general, but not very informative to the physical intuition, or convenient for 
numerical evaluation. Certain developments in series of the general expressions 
will therefore be considered. In the region of low densities and high temperatures 

I ^ I < 1 for aU real a, and it is permissible to develop the infinite integrals in powers 

of the density. Hence 

A = ^NhT 








and 

which is the equation of state. 


fm 

12A \f22(2770*A’ 

- iMT log T+mT log ni 

p-n&Tfl-^ y f” 


) I 


v+l 


( 7 - 1 ) 

( 7 - 2 ) 





113 


A general kinetic theory of liquids 

Under the same conditions, however, Mayer (1937) has obtained an exact develop¬ 
ment quite analogous to (T-l) by evaluation of the integral expression in (2-1). 
A concise and perfected exposition of the method has been given by Born & Fuchs 
(1938); the result obtained is 

A = -NkT^ ^-0kT log T -1- NkT log n^, ( 7 - 3 ) 

j>=l 1 


where the first /?„ are given by 

/*00 

Jo 

afrs) = a{lx(®>-x(’-)|), 


or 


A = (27rm0h 

_ rco 


Comparing ( 7 - 1 ) and ( 7 * 3 ), it is found that 


( 7 ^ 4 ) 


( 7 * 5 ) 


( 7 - 6 ) 


Substituting v = 2 in ( 7 * 6 ), agreement is obtained with the value of A gi'^en by ( 7 - 5 ), 
provided e == 1 ; the remaining A gi'^en by ( 7 * 6 ) are approximate, but far more 
tractable expressions for the viral coef&cients than those obtained by Mayer’s 
combinational method. 

Mayer found that the equation of state derived from ( 7 - 3 ) will not describe the 
liquid phase, for the simple reason that the infinite series becomes divergent on 
approaching the point of condensation. Precisely the same thing happens to the 
present series ( 7 - 1 ) when the transcendental equation ( 5 - 9 ) first has a real root, and 
the provisional hypothesis advanced at the end of the last section is therefore 
confirmed. 

It is important to notice, however, that the solution of the equation ( 5 * 9 ) will 
not necessarily yield the boiling-point as a function of density, but rather the 
highest temperature to which the liquid can be superheated without boiling. The 
boOing-point must be determined by the usual thermodynamical method of con¬ 
sidering the equilibrium between the two phases.* 

* The authors of this series believe that the view of Kahn (1938) on this matter is m- 
eorrect, and will give detailed reasons later. 


Vol. 189. A. 


8 



114 


H. S. Green 


8. Development eoe the uqttid phase 

It has been seen that the power series ( 7 - 1 ) can describe only the gaseous phase, 
that a-Tinf^li ftr form of expansion is required to adapt the formulae of § 6 for the 
liquid as well as the gas. Fop this purpose it is convenient to consider the complex 
roots z„ (m = 1 , 2 , 3 , etc.) of ( 5 - 9 ) in the upper half of the z-plane. Assume that 
yff(z )0 as [ z I ->-00, uniformly with respect to arg z, in this region; by the theory of 
residues, the followii^ expansion is then obtained from ( 5 - 8 ); 

r{f{r)+ea{r)} = - {2nf —S > ( 8 - 1 ) 

where, according to ( 5 * 6 ), 

^ j_^cos(rzJa(5)s2ds-A/e, (8-2) 

so that sin (rzje-i‘®/*2’^(s)5(^5. (8-3) 

Now use ( 8 - 1 ) to calculate E afresh from ( 6 * 1 ), and obtain 


The equation of state, obtained similarly from ( 6 - 2 ), is 




Since 




A?ii 

iN’ 


( 8 - 6 ) 


the equation ( 6 ' 6 ) is again integrable, and 

( 8 - 7 ) 

The formulae of this section hold equally for the liquid and the gas. The difference 
arises from the fact that in the case of the liquid there are two real roots which are 
not included in the tt-summation; when the temperature is raised beyond the boiling- 
point these roots migrate in opposite directions into the complex plane, and the one 
above the real axis is now included in the '^-summation. Thus there is a real dis¬ 
continuity in the radial distribution function and all the thermodynamic functions 
at the point of condensation—a consequence of the theory which is confirmed by 
experience. However, the discontinuity in the radial distribution function corre¬ 
sponds only to a transfer from one branch of the general solution of the integral 
equation to another; the point of condensation is therefore a kind of branch point. 



115 


A general kinetic theory of liquids 

An approximate classical theory of the soUd state appears to result in the 
following way. When the boundary condition (5-10), which can be correct only 

AJc 

for liquids, is omitted, the possibility of a periodic term — sinr^;;^. in the expression 

for where Zj^ is the real root of (5-9), is not excluded. Such periodicity at 
large distances from a given lattice point is characteristic of the solid state. This 
may provide confirmation of the idea of Vlasov mentioned in the introduction to 
the first paper of this series. 


9* Appendix 


Here a proof of the formula (2*1) for the free energy will be given. First show that 
the quantity there defined satisfies the differential equation (6*6), or the equivalent 
partial differential equations 


dTXTj'^ dVXT)'^ 


(9-1) 


where E and p are defined by (6* 1) and (6*2) respectively. Use is made of the formula 

1 n(N-2)C‘ N 

(JV-2)!j ■" J (9-2) 


where 


1 cimr ^ 


(9-3) 


The first of the equations (9-1) follows iromediately on diEferentiating (9-3); to obtain 
the second, write x® = where P = V; then 


JSN rKSiV) ri N 

J-i i-l 


N 


^ = i S ?6{Zieo>-8<«|}, 


so that 




(9-4) 


dQ 

dl 


(9-5) 


and 


^ 1 ^ 

dv BP dl 


= n^Qx-iQijn^ir)^rdT. 
This leads immediately to the second of equations (9*1). 


(9*6) 


8-2 



116 


H. S. Green 


It remains to be shown that the quantities E and p as defined correspond to the 
internal energy and pressure as observed experimentally. That this is so in the case 
of the internal energy follows easily on calculating the mean value of the Hamil¬ 
tonian energy TF. The evaluation of the pressure requires rather more attention; 
in order to make the discussion as general as possible, equilibrium conditions will 
not be assumed, and the pressure tensor in the fluid will be evaluated in the first 
instance. (The analysis which follows is borrowed from part HI on dynamical 
properties, now in preparation by Professor M. Born and the author.) 

Multiply the equation 

^ + ga)_§4. = — ff 


which is satisfied byin the absence of external forces, by and integrate over 
5®, obtaining 

(»■«) 


To simplify further the right-hand side, note that, although macros copically uniform 
conditions are not yet assumed, within the microscopic sphere around where 
is effectively different from zero, may be represented as a function of 
r = |x^ 2 )_x^) writing 


v{x^^\ “ J dr, 


then (9*8) becomes 
3 


Further, since 




(9-9) 

(9-10) 

(9-11) 


0 3 _ g 

where d/dt is the operator ^ (9-10) reduces to 


where 






(9-12) 

(9-13) 



117 


A general kinetic theory of liquids 


® dn, 


and TT^g is any tensor -whicli satisfies S = 0 

jj=i wi' 


TTg,g must be determined by condi¬ 


tions at the boundary of the fluid, where the part of due to the molecular forces, 
namely, 


■+• r, r) drS^g+TT^g, 


must vanish. (The interaction between the walls and the fluid is neglected.) 

In uniform conditions, r) depends only on r, and is a constant tensor, 
given by 





v{r) 


( 9 - 14 ) 


since r ranges only over a hemisphere at the boundary. The hydrostatic pressure in 
equilibrium conditions is therefore given by 


^riikT— f J n^{r)(}>'{r) r inr^dr, 


( 9 - 15 ) 


in agreement with the value found previously by differentiating the phase integral. 


The author wishes to express here his grateful appreciation of numerous helpful 
discussions with Professor Born in the course of this work. He is also indebted to 
Dr R. Fiirth for some useful comments. 


References 

Bom, M. & Fuchs, K. 1938 Proc, Roy. Soc. A, 166, 391. 

Bom, M. & Green, H. S. 1946 Proc. Roy, Soc. A, 188, 10. 
Chapman, S. & Cowling, T. G. 1939 The mathematical theory of 
non-uniform gases, Camb. Univ. Press. 

Kahn, B. 1938 Utrecht Diss, 

Kaplan, W. & Dresden, M. 1944 Phys, Rev, 66, 16. 

Mayer, J. E. 1937 J, Chem, Phys, 5, 67, 74; 6, 87, 101. 



Tables of second virial and low-pressure Joule-Thomson 
coefficients for intermolecular potentials with 
exponential repulsion 

By R. a. Buckingham, Ph.D., University College, London 
AND J. CoBNER, Ph.D., Armament Research Department, Ministry of Supply 

{Communicated by H. S. W. Massey, F,R.S.—Received 6 June 1946) 


The potentials used are functions of r, the distance between the centres of the molecules, but 
not of their relative orientation. The long-range attraction between the molecules is taken to 
be the sum of terms proportional to r-® and r"®, and the short-range repulsion is proportional 
to e“<^. This potential has four parameters: the position and depth ,of the minimum of the 
potential, the relative importance of the r"® term, and the steepness of the exponential 
repulsion. The second virial and low-pressure Joule-Thomson coefficients are tabulated for 
the range of parameters likely to be found in aotual molecules, and for temperatures from the 
critical temperature to about 20 times this value. The most important of the three quantiun 
corrections is tabulated over the same ranges. 


Inteobtjction 

The second virial of a gas is connected "with the field of force round its molecules 
by an integral which is simple in form but in general diffi cult to evaluate analytically. 
This has led to a search for types of intermolecular potential energy which would 
have the right qualitative features and would give an analytical formula for the 
second virial coefficient. The most successful of these forms has been the Lennard- 
Jones potential 

E{r) = (1) 

where r is the distance between the centres of the molecules, A, n, n and m are con¬ 
stants, and n> 7 n> 3 . This has been integrated as a convergent series (Lennard- 
Jones 1924), which is convenient for the analysis of observed second virial coefficients. 
This potential has been applied to many substances. 

More recently, several authors have foimd that in particular cases the repulsive 
part of (1), namely, Ar can with advantage be replaced by an exponential repulsion 
Ae-i" (Bom & Mayer 1932; Bleick & Mayer 1934; Huggins 1937). Unfortunately, 
the second virial coefficient of the resulting potential has not yet been evaluated 
analytically, and it has been necessary to use numerical integration (cf. Herzfeld 
1937) or approximate methods (Buckingham 1938). Recently, Orr (1942) has 
proposed an intermolecular potential of rather different form which has the desired 
feature of an exponential repulsion at short distances, and whose virial coefficient 
can be integrated exactly. 


[ 118 ] 



Second virial and low-pressure Joule-Thomson coefficients 119 

We have computed the second virial coefficient for a potential derived from the 
usual type (1) by replacing the inverse-power repulsion Ar~^ by an exponential 
term taking m = 6, and adding an additional attractive term proportional to 

Seduced to non-dimensional variables, of which there are three, the results 
previously obtained for simple molecules show that one need compute the second 
virial coefficient for only quite restricted ranges of two of the variables. This makes 
the computation not intolerably long, though still rather laborious. The tables of 
second virial coefficients are given in the present paper; their application to various 
substances will follow in a separate paper. 

Hirschfelder, Ewell & Roebuck (1938) have shown how the Joule-Thomson 
coefficient (extrapolated to zero pressure) may be used instead of the second virial 
coefficient to ffiid intermolecular potentials, and they have derived the Joule- 
Thomson coefficient of the potential (1) from Lennard-Jones’s expression for its 
second virial coefficient. We have prepared tables of the Joule-Thomson coefficient 
for the same potentials for which we have computed the second virial coefficient. 
We have also calculated similar tables of the most important of the three quantum 
corrections to the second virial and Joule-Thomson coefficients. 

The potentials used in this paper can apply only to molecules which are nearly 
spherical. 


Eobmulae used 

The second virial coefficient has been defined in several slightly different ways, 
and has been expressed in several sets of units (cf. Fowler 1936, and Hirschfelder 
et aL 1938). We use the definition 

pF = ni:r|l+^+0(F-2)j, (2) 

where p, T and n are the pressure, temperature and number of molecules in the 
volume F. (2) is true at low densities, and defines the second virial coefficient J5, 
which is a function of temperature. B is proportional to n, and has the dimensions of 
a volume. The most usual choices of n are n = N, Avogadro’s number, so that B is 
given as volume per gram-molecule, OTn= L, Loschmidt’s number, which means 
that B is in units of volume (per c.c. of perfect gas at s.t.f.). n need not be specified 
in this paper. 

Take the zero of the intermolecular potential energy E{r) to be the value for 
infinite r. In classical statistical mechanics, B is connected with E{r) by the relation 

B = 27rnj (3) 

There are three quantum corrections: 

(I) the probability of configurations of potential energy E is not proportional to 

(II) certain states of the gas are excluded by the statistics appropriate to the 
molecules under consideration; 



120 


R. A. Buckingham and J. Comer 

(III) when the kinetic energy of the relative motion of two molecules is less than 
the attractive potential energy, the total energy, omitting the translational energy 
of the centre of mass of the pair, can take only certain discrete values. 

The most important correction is the first. It was pointed out by Slater (1931), 
developed by Uhlenbeck & Gropper (1932), Wigner (1932), and most conveniently 
by Elrkwood (1933). Uhlenbeck & Beth (1936) have illustrated the subject by 
application to particular cases. 

The second correction, for the ejBfect of statistics, has been shown by Beth & 
Uhlenbeck (1937) to be negligible except for helium at low temperatures. 

The correction for discrete states has been evaluated by Beth & Uhlenbeck (1937) 
in terms of the phase shifts of the wave functions in the potential field. The correction 
is appreciable only when the number of discrete levels is small and the temperature 
is low, both conditions being necessary. Even for helium, which has at most one 
discrete level,* the correction is appreciable only at low temperatures. For molecules 
such as argon, with something like a dozen discrete levels, the effect is negligible at 
all temperatures. To sum up, only the correction (I) need be taken into account 
except for helium at low temperatures, where it is necessary to include all three 
corrections. 

Thus with the first quantum correction, or rather the first approximation to it, 

B . + )Ve-w*. (4) 


where m is the mass of the molecule. There are terms involving higher powers of 
h^jmhTi but these are appreciable at low temperatures only, and may otherwise be 
neglected. We reduce (4) to a more convenient form by the following substitutions: 


A2 = 


*2 


cr = I, E^-ef{a), x^-^. 


2 nmkT’ ' r„’ “ ^ kT’ 

where and e are the position and depth of the minimum of E{r). Hence 

B = 27rnTl{F^{x) + {Xlr,fF^{x)}, 

where F^{x) = JJ{1 - e*/} a-^da- 

and 




cr^e^fd<r. 


I'or/, the shape of the potential, we use the following forms: 


o-<l, / = 


f = exp|- 4^^ -1 j I 


(5) 

( 6 ) 

(7) 

( 8 ) 

(9) 

( 10 ) 


The existence of this level is not yet established with certainty, as it depends critically on 
the form assumed for the intermoleeular potential. 



Second virial and low-pressure Joule-Thomaon coefficients 121 

where /i = a/{a{l -h /?) - 6 - 8/?}, 

/, = (6 + 8/?)/{a(l +^) - 6 - 8^} = - 1 + (1 +/?) A. 

In (9) there are attractive potentials proportional to and r~^^, and an exponential 
repnlsion. The parameter a measures the steepness of the repulsion, and values 
deduced for particular gases by various authors lie in the neighbourhood of 13 - 5 ; 
our tables cover the range 12*5 to 14 - 5 . The importance of the term, relative to 
the r”® potential, is measured by Margenau (1939) has given a formula for the 
r""® potential which makes yff usually about 0 * 15 ; we have used the values yff = 0 and 
0 * 2 . 

There is a discontinuity in the third derivative of / in passing through <r = 1 . 
This is due to the multiplication of the attractive potential by a factor w'hich prevents 
the whole potential going negative and finally infinite as cr tends to zero, and which 
therefore will make easier a comparison of the potentials with the results of scattering 
experiments (cf. Amdur & Pearhnan 1941). 

Whitelaw (1934) has shown that if cj and /Iq are respectively the specific heat at 
constant pressure, per molecule, and the Joule-Thomson coefficient, both extra¬ 
polated to zero pressure, then 

= j, (11) 

and Hirschfelder et aZ. (1938) have used this equation to derive theoretical expressions 


for /to- 

In the case of the potential ( 9 ) and (10) one can write 

G%iig = 27 t ' r %{ Qg { x ) + (A/ro)® 

(12) 

where 

Gg = j°°(^+xfeF=f-l)crH(r 

( 13 ) 

and 


( 14 ) 


Go is the classical term and Gj the first and most important of the three quantum 
corrections. 


Tables oe the etjnctioiis Fg , F ^, Og and 0 ^ 

These tables cover the following ranges: a from 12-6 to 14 - 6 , z from 0-04 to 0-8, 
and the values of P are 0 and 0-2. 

Interpolation with respect to a and x is possible, but with only two values of ^ it 
is not possible to interpolate with respect to p, except by a linear interpolation. 
However, the variation with ^ is small, and the error in a linear interpolation is 
probably negligible. 

The lower limit of the tabulated values of a:, namely, 0 - 04 , was chosen because x 
is greater than 0-04 for all known virial measurements, except certain results for 
heHnm at hi gh temperatures. The tabulated values for x = 0-04 and /J = 0 are at 



122 


R. A. Buckingham and J. Corner 

present extrapolated. An extension of the tables to this and lower values of x is in 
progress, however, with a view to their application to the special case of helium. 

The upper limit of x, which is 0- 8, lies in the neighbourhood of the critical tempera¬ 
ture. It is not necessary to carry the tabulation to lower temperatures, because the 
form of the curves in this region is such that a unique fit to experimental data is 
impossible. Observations well above the critical temperature are essential to the 
derivation of an intermolecular potential. 

These functions have no special mathematical interest, so the accuracy to be 
attained was settled solely by considerations of their use in analysis of observed 
second virial and Joule-Thomson data. These are subject to errors of at least 1 % 
of the values found at the critical temperature. We have aimed at a maximum error 
of less than 0*1 % in jPJ, and Gq and 1 % in and G^, on this scale, and the intervals 
used in the computation have been chosen to reduce the errors to these limits. 

The tables are all to be interpolated in the same way. The printed differences are of 
two types: those for interpolation with respect to oc are printed between the columns, 
those for interpolation with respect to x are placed between the rows. The differences 
are always the sum of adjacent second differences, modified by a ‘throw-back’ to 
include the effect of fourth differences. These sums are sufficient for Bessel inter¬ 
polation: 

/(a+ nh) = nf{a -{- A) 4- (1 -- n)f(a) -f B"(n) M, 

where n lies between 0 and 1, B^n) is the Bessel coefficient of the second difference, 
and M is the printed difference-sum for the interval {a, a-hh). Where M is omitted, 
linear interpolation introduces an error of less than half a unit of the last place. 


Computation of the tables 

For or ^ can be expanded as a power series, and if one writes 



= j^/^cr^do- 

(16) 

and 


(16) 

then 


(17) 

and 


(18) 


with analogous series for G^ and G^. The integrals and were calculated for 
n up to 7. Values of JJ,, Gq and Gx were computed for a = 12-5 (0*5) 14-5 and 
X = 0*04, 0*05, 0*06, 0*08, 0*1, 0*15, 0*2 (0*1) 0*8, and the tables were completed by 
interpolation. Apart from cheeking against the original computations, the proofs 



Second virial and low-pressure Joule-Thomson coefficmits 123 

have been differenced, and it is believed that few errors are left in the tables. The 
formulae and the intervals used in the numerical integrations were of two types, 
and a number of the integrals were computed by both methods as a check on the 
accuracy attained. 

For permission to publish this work we wish to thank the Chief Scientific Officer, 
Ministry of Supply. 


Table 1. The euhctioh belated to the secoj^td vieial 

COEEEICIENT, EOB = 0 


X 

a = 12-5 


13 


13*5 


14 


14-5 

0-04 

(0-1081) 


(0-1122) 


(0-1161) 


(0-1198) 


(0-1234) 

0-05 

0-1079 


0-1121 


0*1161 


0-1199 


0-1236 


-20 


-20 


-20 


-20 


-20 

0-06 

0-1066 


0-1109 


0-1150 


0-1189 


0-1226 


-15 


-15 


-15 


-15 


-15 

0-07 

0-1044 


0-1088 


0-1130 


0-1170 


0-1207 


-12 


-12 


-12 


-12 


-11 

0-08 

0-1016 


0-1061 


0-1104 


0-1144 


0*1183 


-9 


-9 


-9 


-9 


-9 

0-09 

0-0982 


0-1029 


0-1073 


0-1114 


0-1153 


-7 


-7 


-7 


-7 


-7 

0-10 

0-0945 


0-0993 


0-1038 


0-1080 


0-1120 


-138 


-137 


-136 


-134 


-133 

0-15 

0-0716 

-8 

0-0771 


0-0821 


0-0868 


0*0913 


-71 


-70 


-69 


-69 


-68 

0-20 

0-0444 

-10 

0-0506 

-9 

0-0562 

-8 

0-0615 


0-0664 


-48 


-48 


-47 


-46 


-46 

0-25 

0-0144 

-12 

0-0213 

-10 

0-0276 

-9 

0-0335 

-8 

0-0389 


-38 


-37 


-37 


-36 


-36 

0-30 

-0-0176 

-13 

-0-0100 

-12 

-0-0030 

-10 

0-0035 

-9 

0-0094 


-32 


-32 


-31 


-31 


-30 

0-35 

-0-0514 

-15 

-0*0429 

-14 

-0-0352 

-12 

-0-0282 

-10 

-0-0216 


-29 


-29 


-28 


-28 


-27 

0-40 

-0-0866 

-17 

-0*0774 

-15 

-0-0689 

-13 

-0-0612 

-11 

-0*0541 


-28 


-27 


-27 


-26 


-26 

0-45 

-0-1233 

-19 

-0-1132 

-17 

-0-1040 

-14 

-0*0956 

-12 

-0*0879 


-27 


-26 


-26 


-25 


-25 

0*50 

-0-1614 

-22 

-0-1504 

-19 

-0-1404 

-16 

-0*1313 

-13 

-0-1229 


-27 


-26 


-26 


-25 


-25 

0-55 

-0-2008 

-24 

-0-1889 

-21 

-0-1781 

-17 

-0-1682 

-13 

-0-1592 


-27 


-26 


-25 


-25 


-24 

0-60 

-0-2413 

-27 

-0*2287 

-23 

-0-2170 

-19 

-0-2064 

-16 

-0-1967 


-27 


-26 


-26 


-25 


-25 

0-65 

-0-2836 

-28 

-0*2697 

-24 

-0-2572 

-20 

-0-2458 

-17 

-0-2354 


-27 


-27 


-26 


-26 


-25 

0-70 

-0-3270 

-30 

-0-3121 

-26 

-0-2987 

-22 

-0-2865 

-19 

-0-2753 


-28 


-27 


-27 


-26 


-26 

0-75 

-0-3718 

-32 

-0-3559 

-28 

-0-3416 

-24 

-0-3285 

-21 

-0*3165 


-28 


-28 


-27 


-27 


-26 

0-80 

-0-4180 

-34 

-0-4010 

-30 

-0-3857 

-26 

-0*3718 

-23 

-0-3590 



124 


R. A. Buckingham and J. Comer 


Table 2. The eunctioh related to the second virial 
COEEEICIENT, EOR = 0*2 

^= 12-5 13 13.5 1. 


a = 12*5 
0-1105 
(-31) 
0-1106 
-20 
0-1094 
-15 
0-1074 
-12 
0-1047 
-9 

0-1016 

-8 

0-0981 

-136 

0-0762 

-70 

0-0501 

-48 

0-0212 

-38 

-0-0097 

-31 

-0-0424 

-29 

-0-0764 


0-1147 

(-31) 

0*1149 

-21 

0-1138 

-15 

0-1120 

-12 

0-1094 

-9 

0-1064 

-8 

0-1030 

-136 

0-0819 

-70 

0-0565 

-47 

0-0284 

-38 

-0-0017 

-31 

-0-0336 

-28 

-0-0668 


13-5 

0-1186 

(-31) 

0-1189 

-20 

0-1180 

-15 

0-1163 

-12 

0-1138 

-9 

0-1109 

-8 

0-1076 

-135 

0-0871 

-69 

0-0623 

-46 

0-0349 

-37 

0-0055 

-30 

-0-0256 

-28 

-0-0580 ■ 


-28 

-0-1119 

-20 

-27 

-0-1014 

-18 

-27 

-0-0919 

-16 

-26 

-0-1488 

-22 

-26 

-0-1373 

-20 

-25 

-0-1269 

-18 

-26 

-0-1869 

-24 

-26 

-0-1745 

-22 

-25 

-0-1633 

-20 

-26 

-0-2264 

-27 

-25 

-0-2130 

-24 

-25 

-0-2009 

-21 

-26 
- 0-2672 

-30 

-26 

-0-2527 

-27 

-25 

-0-2397 

-23 

-26 

-0-3093 

-33 

-26 

-0-2938 

-29 

~26 

-0-2798 

-25 

-27 

-0-3527 

-37 

-27 

-0-3361 

-32 

-26 

-0*3212 

-26 

— 27 
-0-3975 

-40 

-27 

-0-3798 

-34 

-27 

-0-3639 

-28 


14 

0-1224 

(-31) 

0-1228 

-20 

0-1220 

-15 

0-1203 

-12 

0-1180 

-9 

0-1152 

-7 

0*1120 

-134 

0-0919 

-68 

0-0677 

-46 

0-0409 

-36 

0-0121 

-30 

-0-0183 - 

-27 

-0-0501 - 

-26 

-0-0832 - 

-25 

-0*1175 - 

-25 

-0-1531 - 

-24 

-0-1899 - 
-25 

- 0-2280 - 
-25 

-0-2672 - 

-26 

-0-3078 ~ 
-27 

-0-3496 - 


14-5 

0-1261 

(-31) 

0-1265 

-21 

0-1257 

-15 

0-1242 

-12 

0-1219 

-9 

0-1192 

-7 

0-1160 

-132 

0*0964 

-67 

0*0727 

-47 

0-0464 

-36 

0-0182 

-29 

-0-0116 
-27 
-0-0427 
-26 
-0-0753 
-24 
-0-1089 
-24 
-0*1439 
-24 
-0-1799 
-24 
-0-2172 
-25 
-0-2557 
-25 
- 0-2955 
-26 
-0-3365 



Second virial and low-pressure Jaule-Thomson coefficients 125 


Table 3. The function F ^,' related to the quantitm coerectiojt 

TO THE SECOND VIRIAL COEEETCIENT, FOE = 0 


X 

a= 12*5 

13 

13*5 

14 

14*5 

0-04 

(0*100) 

(0*104) 

(0*109) 

(0*114) 

(0*119) 

0-05 

0*107 

0*111 

0*116 

0*121 

0*127 

0-06 

0*113 

. 0*118 

0*123 

0*128 

0*134 

0*07 

0*119 

0*124 

0*129 

0*134 

0*140 

0-08 

0*124 

0*129 

0*135 

0*140 

0*146 

0*09 

0*129 

0*135 

0*140 

0*146 

0*152 

0*10 

0*134 

0*140 

0*145 

0*151 

0*157 

0*1S 

0*157 

0*164 

0*170 

0*176 

0*183 

0*20 

0*179 

0*186 

0*193 

0*200 

0*208 

0*25 

0*201 

0*209 

0*217 

0*225 

0*233 

0*30 

0*224 

0*232 

0*241 

0*250 

0*258 

0*35 

0*247 

0*257 

0*266 

0*276 

0*285 

0*40 

0*272 

0*282 

0*293 

0*303 

0*314 

0*45 

0*299 

0*310 

0*321 

0*332 

0*344 

0-50 

0*326 

0*339 

0*351 

0*363 

0*376 

0*55 

0*356 

0*370 

0*383 

0*396 

0*410 

0*60 

0*388 

0*402 

0*417 

0*432 

0*447 

0-65 

0*422 

0*438 

0*453 

0*470 

0*486 

0-70 

0*458 

0*475 

0*493 

0*510 

0*528 

0*75 

0*497 

0*515 

0*534 

0*554 

0*573 

0-80 

0*539 

0*559 

0*579 

0*600 

0*621 


Table 4. The function F ^, related to the quantum 

TO THE SECOND VIRIAL COEFFICIENT, FOR = 0-2 

COERECTION 

X 

a = 12*5 

13 

13*5 

14 

14*5 

0*04 

0*103 

0*107 

0-112 

0*117 

0*122 

0*05 

0*110 

0*114 

0*119 

0*124 

0*129 

0*06 

0*116 

0*121 

0*126 

0-131 

0*136 

0*07 

0*122 

0*127 

0*132 

0-137 

0*143 

0*08 

0*127 

0*132 

0*138 

0*143 

0*149 

0*09 

0*132 

0*137 

0-143 

0*149 

0*155 

0*10 

0*137 

0*142 

0-148 

0*154 

0-160 

0*15 

0*160 

0*166 

0-172 

0*179 

0*186 

0*20 

0*182 

0*189 

0-196 

0*203 

0*210 

0*25 

0*203 

0*211 

0-219 

0*227 

0*234 

0*30 

0*225 

0*234 

0*242 

0*251 

0*259 

0*35 

0*248 

0*257 

0-266 

0*275 

0*285 

0*40 

0*272 

0*282 

0-292 

0*302 

0*312 

0*45 

0*296 

0*307 

0-318 

0*329 

0*340 

0*50 

0*323 

0-335 

0-346 

0*358 

0-370 

0*55 

0*351 

0*363 

0-376 

0*388 

0*401 

0*60 

0*380 

0*394 

0-407 

0*421 

0*434 

0*65 

0*411 

0*426 

0*441 

0*455 

0-469 

0*70 

0*445 

0*460 

0-476 

0*491 

0*506 

0*75 

0*481 

0*496 

0*513 

0-529 

0*545 

0*80 

0*518 

0-536 

0*553 

0*571 

0*588 



126 


R. A. Buckingham and J. Corner 


The fctnctioh Oq^ belated to the Jottle-Thomson 

COEFFICIENT, FOR y? = 0 


Table 5. 


X 

a = 12*5 

0*04 ( 

-0-1108) 

0*05 

-0*1040 


23 

0-06 

-0-0958 


17 

0*07 

-0-0866 


13 

0*08 

-0*0767 


10 

0*09 

-0*0662 


8. 

0*10 

-0*0552 


172 

0-15 

+ 0*0047 


104 

0-20 

0*0706 


84 

0-25 

0*1409 


76 

0-30 

0*2152 


72 

0-35 

0*2931 


72 

0*40 

0-3747 


74 

0*45 

0-4598 


76 

0*50 

0-5487 


78 

0*55 

0*6415 


81 

0*60 

0-7382 


85 

0*65 

0-8391 


89 

0*70 

0*9443 


93 

0-75 

1-0541 


97 

0*80 

1-1686 


13 

(-0*1153)' 

-0*1087 

23 

8 -0*1008 
17 

8 -0*0919 

13 


9 

-0-0822 

10 

10 

-0*0720 

8 

11 

-0*0613 

169 

14 

-0*0028 

102 

18 

0*0616 

82 

21 

0-1303 

74 

25 

0*2029 

71 

29 

0*2790 

71 

32 

0-3587 

72 

38 

0*4419 

74 

42 

0*5287 

76 

47 

0*6193 

79 

52 

0-7138 

83 

57 

0-8124 

86 

63 

0-9152 

90 

67 

1-0224 

95 

72 

1-1342 


8 

13*5 

(-0-1196) 

-0*1132 

23 

-0*1054 

17 

-0*0967 

8 

13 

-0*0873 

9 

10 

-0*0773 

9 

8 

-0*0669 

12 

167 

-0-0096 

15 

101 

0-0534 

18 

80 

0-1207 

21 

72 

0*1917 

25 

69 

0*2663 

29 

69 

0-3442 

33 

71 

0*4257 

36 

72 

0*5107 

40 

75 

0*5994 

44 

77 

0-6919 

49 

81 

0-7883 

54 

84 

0*8889 

59 

88 

0*9938 

64 

92 

1*1031 


14 

(-0*1236) 

-0*1173 

23 

-0*1098 

17 

-0*1013 

13 

-0*0920 

10 

-0*0822 

8 

8 -0*0720 

165 

10 -0*0159 

99 

13 0*0459 

79 

16 0*1119 

71 

18 0*1816 

68 

21 0*2547 

68 

25 0*3311 

69 

28 0*4110 

71 

31 0*4944 

73 

34 0*5813 

76 

38 0*6720 

79 

41 0*7665 

83 

45 0*8651 

86 

50 0*9679 

90 

55 1*0752 


14*5 

(-0*1273) 

-0*1213 

23 

-0*1139 

16 

-0*1055 

13 

-0*0965 


8 

10 

-0*0869 

8 

-0*0768 

163 

-0-0217 

11 

97 

0*0390 

13 

77 

0-1038 

15 

70 

0-1723 

18 

67 

0*2441 

21 

66 

0*3192 

23 

67 

0*3976 

25 

69 

0*4794 

29 

72 

0*5648 

32 

75 

0*6538 

34 

78 

0-7466 

37 

81 

0*8434 

40 

85 

0*9443 

44 

89 

1*0496 



Second virial and low-pressure Joule-TTiomson coefficmits 127 


Table 6. The EUNCTioisr 0^, belated to the Jotjle-Thomsoh 
COEE5TCIEHT, FOR y? = 0-2 


£15 

a =12*5 

0*04 

-0*1139 

33 

0*05 

-0*1075 

23 

0*06 

-0*0997 

17 

0*07 

-0*0909 

13 

0*08 

-0*0814 

10 

0*09 

-0*0714 

9 

0*10 

-0*0608 

170 

0*15 

-0*0031 

104 

0-20 

+ 0*0606 
82 

0*25 

0*1287 

74 

0*30 

0*2005 

72 

0*35 

0*2760 

71 

0*40 

0*3550 

72 

0*45 

0*4376 

74 

0*50 

0*5238 

77 

0*55 

0-6138 

80 

0-60 

0*7077 

83 

0*65 

0*8057 

86 

0*70 

0*9080 

91 

0*75 

1*0146 

95 

0*80 

1*1260 


13 


8 

-0*1186 

33 

-0*1124 

23 

-0*1049 

9 

17 

-0*0964 

10 

13 

-0*0872 

10 

10 

-0*0774 

11 

9 

-0*0671 

17 

168 

-0*0109 

20 

102 
+ 0*0512 

24 

80 

0*1176 

28 

72 

0*1876 

32 

70 

0*2613 

37 

69 

0*3383 

42 

71 

0*4188 

47 

72 

0*5029 

51 

75 

0-5907 

56 

78 

0*6823 

62 

81 

0*7778 

68 

84 

0*8775 

74 

89 

0*9815 

80 

92 

1*0901 


8 

13*5 

-0*1230 

32 

-0*1170 

23 

-0*1097 

17 

-0*1014 

9 

13 

-0*0924 

9 

10 

-0*0829 

10 

8 

-0*0729 

14 

166 

-0*0180 

17 

100 
+ 0*0427 

20 

78 

0*1076 

24 

70 

0*1761 

28 

68 

0*2480 

32 

68 

0*3233 

36 

69 

0*4020 

40 

70 

0*4842 

44 

73 

0*5700 

48 

76 

0*6595 

53 

79 

0*7529 

58 

83 

0*8503 

63 

86 

0*9520 

69 

89 

1*0580 


14 

-0*1271 

31 

-0*1213 

22 

-0*1142 

17 

-0*1061 

13 

-0*0973 

10 

8 -0*0880 
8 

9 -0*0782 

163 

11 -0*0244 

98 

14 +0*0350 

77 

17 0*0985 

69 

20 0*1656 

67 

24 0*2361 

66 

27 0*3098 

67 

30 0*3869 

69 

33 0*4674 

71 

37 0*5513 

74 

41 0*6389 

77 

44 0*7304 

81 

48 0*8257 

84 

52 0*9252 

87 

58 1-0290 


14*5 

-0*1309 

31 

-0*1253 

22 

-0*1183 

17 

-0*1105 

13 

-0*1018 

10 

-0*0927 

8 

8 -0*0831 

161 

9 -0*0304 

96 

11 +0*0279 

75 

14 0*0903 

68 

17 0*1560 

66 

20 0*2252 

65 

22 0*2975 

66 

24 0*3731 

68 

27 0*4520 

70 

30 0*5344 

73 

33 0*6203 

76 

35 0*7099 

79 

38 0*8034 

83 

41 0*9009 

85 

46 1*0027 



128 


R. A. Buckingham and J. Comer 


Table 7. The puNCTroH — Q ^, belated to the quasttum coerectiost 
TO THE JoULE-ThOMSON COEEElCaENT, EOE fi — 0 


X 

a= 12*5 

13 

13*5 

14 

14*5 

0-04 

(0*229) 

(0*239) 

(0*249) 

(0*259) 

(0*270) 

0*05 

0*247 

0*257 

0*267 

0*278 

0*290 

0*06 

0*262 

0*273 

0*284 

0*295 

0*307 

0*07 

0*277 

0*288 

0*299 

0*311 

0*323 

0*08 

0*291 

0*302 ‘ 

0*314 

0*326 

0*339 

0*09 

0*304 

0*316 

0*328 

0*341 

0-354 

0*10 

0*317 

0*330 

0*342 

0*355 

0-369 

0*15 

0*381 

0*396 

0*411 

0-426 

0*441 

0*20 

0-446 

0*463 

0*480 

0-497 

0-514 


8 

8 

9 

9 

10 

0*25 

0*514 

0*533 

0*553 

0*572 

0*592 


11 

11 

11 

11 

12 

0*30 

0*586 

0*608 

0*630 

0*652 

0*674 


12 

13 

13 

14 

15 

0*35 

0*664 

0*689 

0*714 

0*739 

0*764 


14 

14 

15 

16 

16 

0*40 

0*749 

0*776 

0*805 

0*833 

0*861 


15 

16 

17 

18 

19 

0*45 

0*841 

0*872 

0*903 

0-935 

0-967 


17 

18 

19 

20 

21 

0-50 

0*941 

0*976 

1*011 

1*047 

1*083 

0*55 

19 

20 

21 

22 

23 

1*050 

1*089 

1*129 

1*169 

1*210 

0*60 

21 

22 

23 

24 

25 

1-169 

1*213 

1*257 

1*303 

1*349 

0*65 

23 

24 

25 

26 

27 

1*298 

1*348 

1*398 

1*449 

1*500 

0*70 

24 

25 

26 

27 

28 

1*440 

1*495 

1*551 

1*609 

1*665 

0*75 

26 

27 

28 

29 

29 

1*595 

1*656 

1*718 

1*782 

1*845 

0*80 

1*762 

1*830 

1*900 

1*971 

2-040 



Second virial and low-pressure Joule-Thomson coefficients 


Table 8. 

The function 


BELATED TO THE 

QUANTUM COBEECTIO]:^’ 


TO THE Joule-Thomson coefficient, 

FOR y? = 0*2 


X 

a = 12*5 

13 

13*5 

14 

14*0 

0*04 

0*235 

0*245 

0*255 

0*265 

0*276 

0*05 

0*252 

0*262 

0*273 

0*284 

0*295 

0*06 

0*268 

0*278 

0*289 

0*301 

0*312 

0*07 

0*282 

0*293 

0*305 

0*316 

0*329 

0*08 

0*296 

0*307 

0*319 

0*332 

0*344 

0*09 

0*309 

0*321 

0*334 

0*346 

0*359 

0*10 

0*322 

0*335 

0*348 

0*361 

0*374 

0*15 

0*385 

0*400 

0*415 

0*430 

0*445 

0*20 

0*449 

0*466 

0*483 

0*500 

0*517 

0*25 

0*515 

0*534 

0*553 

0*572 

0*591 


8 

8 

8 

9 

10 

0*30 

0*584 

0*606 

0*627 

0*648 

0*669 


10 

10 

11 

12 

12 

0*35 

0*659 

0*682 

0*706 

0*729 

0*752 


11 

12 

13 

13 

14 

0*40 

0*738 

0*764 

0*791 

0*817 

0*842 


13 

14 

14 

14 

15 

0*45 

0*824 

0*853 

0*882 

0*911 

0*940 


14 

15 

15 

15 

15 

0*50 

0*916 

0*949 

0*981 

1*013 

1*045 


16 

16 

17 

16 

16 

0*55 

1*016 

1*052 

1*088 

1*123 

1*157 


18 

17 

18 

18 

18 

0*60 

1*125 

1*164 

1*203 

1*241 

1*277 


19 

19 

19 

19 

20 

0*65 

1*242 

1*285 

1*327 

1*368 

1*408 


20 

20 

21 

21 

22 

0*70 

1*370 

1*416 

1*461 

1*506 

1*549 


20 

21 

22 

23 

24 

0*75 

1*507 

1*557 

1*606 

1*654 

1*702 


21 

22 

22 

23 

24 

0*80 

1*654 

1*709 

1*762 

1*815 

1*867 


Refeeencbs 

Amdur, I. & Pearlman, H. 1941 J. Chem. Phys. 9, 503. 

Beth, E. & Uhlenbeck, G. E. 1937 Physica, Gmv, 4, 915. 

Bleick, W. E. & Mayer, J. E. 1934 ' J. Ghem- Phys, 2 , 252. 

Bom, M. & Mayer, J. E. 1932 Z. Phys, 75, 1 . 

Buckingham, B. A. 1938 Ptoc, Roy, Soc, A, 168, 264. 

Fowler, R. H. 1936 Statistical mechanics, chapter x. Camb. Univ. Press. 
Herzfeld, K. F. 1937 Phys. Eev. 52, 374. 

Hirschfelder, J. O., Ewell, R. B. & Roebuck, J. R. 1938 J. Ghem, Phys, 6 , 205. 
Huggins, M. L. 1937 Chem. Phys, 5, 143. 

Kirkwood, J. G. 1933 Phys. Bev, 44, 31. 

Lennard-Jones, J. E. 1924 Proc. Boy. Soc. A, 106, 463. 

Margenau, H. 1939 Bev. Mod. Phys. 11, 1. 

Orr, W. J. C. 1942 Proc. Gamb. Phil. Soc. 38, 224. 

Slater, J.C. 1931 Phys. Bev. SS, 2Z1, 

Uhlenbeck, G. E. & Beth, E. 1936 Physica, V Grav. 3, 729. 

Uhlenbeck, G. E. & Gropper, L. 1932 Phys. Bev. 41, 79. 

Whitelaw, K. G. 1934 Physica, ^s Grav. 1, 749. 

Wigner, E. 1932 Phys. Bev. 40, 749. 


129 


Vd. 189. A. 


9 



Tlie oblique reflexion of very long wireless waves 
from the ionosphere 

Bx M. V. Wilkes, Mathematical Laboratory, Cambridge 
{Communicated by Hodge, F.R.8,—Received 23 July 1946) 

An attempt is made in the paper to provide a satisfactory theoretical basis for a future 
discussion of the experimental data on the propagation of very long radio waves (18,800 in.) 
given by Best, Eatcliffe & Wilkes, and Budden, Ratcliffe & Wilkes. The reflexion of very long 
plane waves incident obliquely on a horizontally stratified ionized medium with a vertical 
magnetic field is first considered in general terms, and it is shown that the medium can be 
divided into a transition region and a reflecting region. 

If the ionization in the reflecting region increases linearly with height it is shown that 
propagation is governed by the following equations: 

d^M 

M+fiL = 0 , 

where a and p are constants depending on the angle of incidence. Under the conditions of 
the experiments p is small, and a solution, in terms of contour integrals, valid in this case is 
obtained. 

1 . Inteodtjction 

Ihiiing the period 1934-9 a series of observations were made at Cambridge on the 
nature of the downcoming wave received from the Post Office transmitter at Rugby- 
on a wave-length of 18,800 m. (Best, Ratchffe & Wilkes 1936 ; Budden, Ratcliffe & 
Wilkes 1939 ). These observations were supplemented by measurements made -with 
a mobile field strength measuring set at a large number of points at distances between 
70 and 380km. from the transmitter. The experiments at Cambridge corresponded 
to an angle of incidence of the reflected wave on the ionosphere of about 30°, and 
those with the mobile apparatus to angles between about 25° and 70°. 

The theoretical interpretation of these experimental results presents some 
difficulty, arising from the fact that with very long waves it is not possible to Tna.lrft 
the assumption which is admissible in the case of short waves, that the parameters of 
the ionosphere change by only a small fraction of themselves in a distance equal to 
the wave-length (Booker 1936 , 1938 ; Forsterling & Lassen 1933 ). In other words, 
the problem may not be treated by the methods of geometrical optics, and any 
theoretical discussion must proceed from a consideration of the differential equations 
of the magneto-ionic theory. 

Only in the case of vertical incidence (.with the further assumption that the earth’s 
magnetic field is vertical) is it found that these equations can be handled at aU 
easily, and this case has been discussed in detail by the present author (Wilkes 1940 ). 
There are then two components which are propagated independently in the iono¬ 
sphere. The experiments indicated, as far as they went, that the character of the 
reflected wave did not depend markedly on the angle of incidence, and it was there¬ 
fore thought that some understanding of the mechanism of reflexion might be 
obtained by applying vertical incidence theory to the case of waves received at 

[ 130 ] 



Oblique reflexion of very long wireless waves from the ionosphere 131 

Cambridge, after reflexion at an angle of incidence of about 30°. While this method 
undoubtedly leads to a general explanation of the observed phenomena, it cannot 
be regarded as logically satisfactory until it is shown that there is no large change in 
the nature of the reflected wave as the angle of incidence is reduced from a few 
degrees to zero; there is the possibility that the radical simplification of the equations 
which takes place when incidence becomes normal may be accompanied by a radical 
change in the mechanism of reflexion. Further, the vertical incidence theory can of 
necessity throw no light on the variation of the phase of the reflected wave with 
angle of incidence. This information is needed in order to enable the true height of 
reflexion to be deduced from the data given in figure 1 of Budden et aL ( 1939 ); at 
present only an effective height can be obtained. 

An attempt will be made in this paper to give a more satisfactory basis for the 
theoretical interpretation of measurements made at short and medium distances 
from a very long wave transmitter, by discussing the problem of reflexion at oblique 
incidence. The method of approach will be that of direct solution of the field equations 
subject to the condition that the wave-length is very long. The assumption that the 
earth’s magnetic field is vertical will be retained. 

It will be found that the solution obtained consists of the solution for the case of 
vertical incidence, together with certain correcting terms, which tend to zero as the 
angle of incidence is reduced. The use of the vertical incidence results as an approxi¬ 
mation to the solution in the case of oblique incidence is in this way justified. 

In § 2 the equations of the magneto-ionic theory will be written down, and in 
§ 3 it win be shown that the ionosphere may be divided into a ^ transition ’ region and 
a ‘reflecting’ region, of which the latter is the more important from the present 
point of view. In § 4 (a) the solution of the equations of propagation in the reflecting 
region, assuming a linear variation of ionization with height, will be shown to depend 
on the solution of a certain second-order ordinary differential equation, and in 
§ 4(6) a solution of this equation in terms of a series of functions will be obtained. 
Asymptotic forms will be given in § 4(c)- 
Where it is necessary to consider numerical values, those appropriate to the 
experiments described by Best et aL ( 1936 ) and Budden et aL ( 1939 ) will be taken. 
Detailed discussion of the application of the analysis to the interpretation of these 
experiments wiU, however, be reserved for another paper. A brief indication of the 
lines this discussion will follow wfll be given in § 5. 

2 . The field equations 
The following notation will be used: 
c = velocity of electromagnetic waves in vacuo, 
m = mass of an electron, 
e = charge of an electron. 

N = electron density. 


9-2 



M. V. Wilkes 


132 


Hg = the earth’s magnetic field assumed vertical. 

= 2jr X frequency of waves. 

V = a quantity which determines the effective damping, and is proportional to the 

frequency of electronic collisions. 

Y — 

mp 


r = lY|.* 

Z = v/p.* 

E = electric field of the wave. 

H = magnetic field of the wave. 

P = polarization of the medium. 

€o = dielectric constant of free space (8-854pJ'/m in practical units). 

Po = permeability of free space {4aT x 10-'^ henries/m. in practical units). 
c = (poCo)"* = velocity of electromagnetic waves in vacuo. 

Jc = pjc. 

H = scale height of the atmosphere. 

The field equations are 

curlE = -po^, curlH = eo9^+^- 


These lead to curl curl E = /IqCqP^E Vo P 

= ( 2 - 1 ) 


where a factor has been taken. 

The polarization P is thought of as arising from the displacement of the electrons 
in the medium under the action of the applied electromagnetic field of the wave 
from the orbits they would otherwise describe. As is usual in developing the 
magneto-ionic theory, it is assumed that this treatment is applicable to a medium 
containing free electrons. The extra polarization term of the Lorentz theory will not 
be included. If the magnetic field of the wave is neglected it follows that 


8^P ^ 


iWE+'isfH.Af 

m m ^ ot 


or P(l-iZ) = -ZeoE-iYAP. (2-2) 

* These three quantities are often denoted by x, y, z. Large letters have been used in order 
to avoid confusion with the spacial co-ordinates. 



Oblique reflexion of very long wireless waves from the ionosphere 133 


Since this analysis will ultimately be applied to the case of a plane wave incident 
at an angle 0 on a stratified medium, it is convenient to assume that all the dependent 
variables are proportional to and are independent of y. The variation in the 

X direction will then be the same as that in the plane wave in free space, and Snell’s 
law will be automatically satisfied.f If the components of the vectors E and P 
divided by this factor are denoted by Ey, E^, and Py, it follows, on 
expanding (2*1), that 


-:rr-iksm6^ = 

dz^ dz 

iz^ ^ 

J jp 

—iksmO-j-^—k^sia^dE^ = 
dz 


-k^E^-p^fiaP^ 
-k^Ey-p^/iQpy,} 

-k^E^-p^ji^P^. 


(2-3) 


Expanding (2-2), and solving for Py, P^, taking the magnetic field vertical, we 
have 






p, = - 


nr 


Finally, substituting these values in (2-3) and rearranging, we have 

d{ ahx^e dEA 
dz{cos^d-XI(l-iZ) dz } 

= - ± - 1 + r-i j ± ^ Py' 


E. = t 


ska6 


dE^ 


kGos^d—Xjil — iZ) dz 


(2*4) 

(2-5) 


3. The transition and reelecting regions 

It is important to have an appreciation of the numerical values assumed by the 
quantities X and Y in the practical causes presented here. Y may be expressed as 
the ratio of the gyromagnetic frequency of electrons in the field of the earth to the 
frequency of the waves, and consequently becomes large for low frequencies. Eor 
waves of frequency 16 kyc./sec. and latitude 52° the numerical value of Y is about 80, 

t An ordinary transmitting aerial may be supposed to radiate plane waves in all directions, 
and in order to discuss the characteristics of the wave received at a fixed point after reflexion 
by the stratified medium, the value of 6 corresponding to the predominant group must be 
determined by the principle of stationary phase (Hartree 1931 ; Havelock 1914 ). 


9-3 



134 


M. V- Wilkes 


if the magnetic field concerned is taken to be the vertical component of the 
earth’s magnetic field. The expressions 1 ± T in (2-4) may therefore, without 
sensible error, be replaced by ± F. 

X is proportional to the ionization density, N, and if there is no imposed magnetic 
field, and no collisional absorption, it is well known that the refractive index q is 
given by ^2 _ i w 


The refiecting level for short waves is therefore—^in the case of vertical incidence— 
given by -Z = 1 . If a magnetic field is introduced, it is found that the level X = 1 
is still of importance, but that there is now another level at which reflexion may 
take place, namely, where X= 1 -f F (Booker 1934 , 1935 ; Taylor 1934 ). (It is 
assumed that the frequency is less than the gyromagnetic frequency.) It might be 
expected that these two levels would retain their importance in the case of very long 
waves, even though the application of wave theory, rather than geometrical optics, 
is required for the solution of the problem. If, however, one puts numerical values 
corresponding to a frequency of 16kyc./sec. in the equation defining X, one finds, 
approximately, X = ^N, The level X = 1 thus corresponds to an ionization density 
of only 3 electrons per c.c., and there is no doubt that such a density wUl be reached 
at very low levels in the atmosphere where the electrons are so heavily damped by 
collisions that they are without effect on the passage of the waves. The level X = 1 
(or X = cos^ d, in the case of oblique incidence at an angle 6) thus loses its significance 
in the case of very long waves, and attention is to be concentrated on the higher level 
where X = 1 + Y (= 80 approximately). 

The maximum daytime ionization in the E region is of the order of several times 
10 ®, so that the maximum value of X is of the order of 10 ®. Viewed from the point 
of view of refiecting very long waves, the ionosphere is thus highly over-dense. 

The changes which are observed to occur in the constitution of the ionosphere as 
the height of the level under consideration is increased will now be examined with 
special reference to the magnitude of the second term in equation (2*4), which may 


be written 


dz\ dz j’ 


where 


_ sin^^ _ 

6—X 1(1 — iZy 


(3d) 


The frequency of electronic collisions decreases with increasing height in propor¬ 
tion to the decrease of air pressure. In a region where the scale height of the atmo¬ 
sphere may be regarded as constant, Z will therefore decrease exponentially with 
height according to the equation 

where Z^ is a constant. 

The electron density, on the other hand, is an increasing function of height in the 
region where very long waves are reflected. If it is assumed that reflexion takes 
place at the bottom of a region of the t 3 q)e described by Chapman ( 1931 a, b, 1939 ), 
the variation of X with height may be expressed by the equation 


X = Xoexp[i{l-zlH-e-^l^f{x))l 



Oblique reflexion of very long wireless waves from the ionosphere 135 

where the height z is measured from a certain reference level, and where /(;^) is a 
function of the zenith angle x of f sun which approximates to sec x for small values 

ofx- 

Figure 1 has been drawn to illustrate the variation of Z and X with height. The 
value of Xq taken is 10 ®, and the scale height 6 km., the value deduced from the 
experiments described in BuddeneiaZ. ( 1939 ). As the zenith angle ofthe sun changes, 
the curve for X moves up and down the height axis by an amount approximately 
equal to the change in log fix) multiplied by the scale height. The curve for Z has 
been drawn with an arbitrary value of Zq I any alteration in the value of this constant 
will have the effect of moving the curve bodily up or down the height axis. 



At very low levels, I is real and equal to tan^^, and dljdz is zero. As the height 
increases Z becomes complex, and its modulus may increase in value, depending on 
the value of Z^^, At high levels, Z decreases to a small value, and X becomes large; 
both Z and dljdz become small compared with unity and k respectively, and the second 
term on the left-hand side of equation (2-4) may therefore be neglected. The region 
in which this approximation may be made, and in which at the same time Z is 
negligibly small compared with F, will be referred to as the reflecting region,, and the 
region immediately below it as the transitign region- The latter extends down to 
levels where propagation is no longer influenced by the presence of ionization. The 
use of these terms is justified below, where it is shown that on the wave-lengths 
concerned no reflexion of energy takes place below the top of the transition region. 



136 


M. V. Wilkes 


The height at which the second term in equation (2-4) first becomes negligible 
may easily be found for the case of a Chapman region. I^om equation (Sd), if 2 is 
put equal to zero, and X assumed to be large, it follows that 

dl sin® 6 dX, . 

Taking for purposes of illustration the case referred to above in which the wave¬ 
length is 18*8km., we have the foflowing numerical values; Xq is taken equal to 10®, 
H equal to 6 km. and X — 



z 


1 dl 

X 

km. 

1 

Ic dz 

5 

-19-1 

— 0*20 sin^ 6 

M5sm2<9 

10 

-18*7 

—0*15 sin^^ 

0*54 sin^^ 

20 

-18*3 

— 0*05 sin^^ 

0*25 sin^ d 

80 

-17*4 

— 0*012 

0*054 sin^^ 


It will be seen that in this case I and are suiBficiently small compared with 

unity to be neglected at all levels above that for which X is greater than about 10 
or 20 depending on the value of d. 

Certain conclusions may be drawn about conditions in the transition region. If 
Z were sufficiently small, I would become very large at the level where X = cos^^, 
and reflexion of a large part of the incident energy might be expected to occur. As 
pointed out above, however, consideration of the numerical values involved allows 
this possibility to be ruled out absolutely, for it is found that the ionization density 
required would be only one or two electrons per c.c., and there can be no doubt that 
such a degree of ionization is reached at levels far below that at which Z can be 
neglected. 

A sharp decrease of Z occurs in the transition region, and if X were at all large this 
would lead to a sudden change in propagation conditions, and consequent reflexion 
of a considerable part of the energy. It is, however, easy to see that this is not the 
state of affairs in the ionosphere at any rate as far as waves of length 18* 8 km. are 
concerned, since if the values of Z and X were suitably adjusted to give the observed 
reflected wave at midday, say, this condition would no longer hold when the curve 
of X moved upwards with increase in the zenith angle of the sun; in other words, 
the changes in reflexion height would be accompanied by large changes in polariza¬ 
tion and amplitude of the reflected wave, which is the contrary of what is observed 
(Best et al. 1936 ; Budden et ah 1939 ). One must therefore conclude that Z has fallen 
away sensibly to zero while X is still quite small. This point is dealt with more fully 
by Wilkes ( 1940 ). 

It is not necessarily true that the curve for X follows the shape of a Chapman 
region down to the lowest levels of the transition region; there may well be depar- 



Oblique reflexion of very long ivireless waves from the ionosphere 137 

tures from it, due, perhaps, to the overlapping of other regions formed by the ioniza¬ 
tion of different constituents of the atmosphere under the influence of different parts 
of the sun’s spectrum. The fact that in summer some absorption is observed which 
remains constant while large changes are taking place in the reflexion height, would 
point to the existence, at some level where Z is not negKgible, of a stratum of 
ionization not depending much on the zenith angle of the sun until near sunset. 

With this exception (which only affects the amplitude of the wave, and not its 
polarization) it will be seen that the evidence is that the lower part of the transition 
region, i.e. the part in which Z is not negligible, has no appreciable effect on the 
waves passing through it. The possibility remains, however, that the upper part of 
the region, where Z is negligible, may produce some change in the polarization, the 
amount of which will depend on the thickness to be traversed, and on the rate of 
change of electron density. 

The reflecting region is simpler to study than the transition region not only 
because the second term in equation (2*4) is negligible, but also because only the 
variation of X with height has to be taken into account, the value of Z being effec¬ 
tively zero. A study of the transition region, on the other hand, involves detailed 
assumptions about the relative magnitudes of Z and X. Since it is possible, although 
by no means certain, that the transition region has an inappreciable effect on the 
waves passing through it, the most profitable way of proceeding appears to be to 
defer its study until the reflecting region has been studied in detail. It may then be 
possible, by making use of the experimental data referred to earlier, either to confirm 
the assumption that the action of the transition region is negligible, or alternatively 
to estimate its effect and thereby obtain information about the degree of ionization 
and the coUisional frequency in it. 

The remainder of this paper will be devoted to the study of the reflecting region 
with the assumption that ionization varies linearly with height. The analysis is 
somewhat involved, and the present paper goes no farther than to obtain the formal 
solution of equation (2-4), This solution will involve four arbitrary constants. In 
order to obtain the phase and amplitude of the reflected wave it is necessary to 
choose these constants so that the solution is finite at high level and satisfies certain 
conditions at the lower boundary of the ionosphere, where the solution must pin 
continuously on to one representing an upgoing incident wave and a downgoing 
reflected wave in free space. It is hoped to deal with these aspects of the problem 
in a later paper. 


4. The reelecting region 
{a) Transformation of the equations 

In this region where I is, by definition, negligible, it is convenient to put 

{4a) 

(4-2) 


L =: E^+iEy, 
M = E^--iEy. 



138 


M. V. Wilkes 


When I is neglected, and Z put equal to zero, equations (2-4) become 

y^+k\l-\sisx^d + XlY)L+Wsm^6M = 0, (4-3) 

0/Z^ 

^ + F(l~Jsiii2^-X/r)ilf+p2sin20L = 0, (4-4) 


•where the further approximation has been made of replacing 1 ± J by ± F. 

Before the equations can be solved, it is necessary to assume some law of variation 
of X with height. In this paper the simplest possible law will be taken, namely, 

X = KYz, (4-5) 

where Z is a constant. For many purposes this may be taken as a sufficiently close 
representation of the ionosphere, but if necessary an approximation to the solution 
in the general case may be obtained by dividing the region into a series of subregions, 
in each of which the variation can be represented with sufficient precision by a 
straight line. 

Substituting the value of X given by equation (4-5) into equations (4-3) and 
(4*4), and writing 


^={k^K)iz, 

(4-6) 

a = (1 — Jsin® 6) k^K-^, 

(4-7) 


(4-8) 

ijzr. 

^+(a+Di+/?M = 0, 

(4-9) 

—+{a-QM+fiL = (i. 

(4-10) 


When /? = 0 , these equations become independent of each other, and it is easily 
shown, by the method of Laplace transforms, that the following contour integrals 
are solutions, provided the contour is suitably specified: 

■ L(Q = Jexp +(a+C) t] dt, M{Q = Jexp + (a - 01] dt. 

These lead to the well-known solutions in terms of the Airy integral and its associated 
function (Miller 1946 ; Jeffreys & Jeffreys 1946 ). 

A similar approach is possible when ^ is not equal to zero, but it is found that the 
general solution carmot then be obtained in terms of a single contour integral, and 
that the sum of several such integrals must be taken. 


Consider the integrals 

m =^8{t)e-’^‘dt, 

(4-11) 


J(^ =Jy{«)e-»d«, 

(4-12) 



Oblique reflexion of very long wirdess waves from the ionosphere 139 

where the contour of integration will be defined later. Denote by i2i[/{C), 

J (^)] tlie remainders when 2(^), J (^) are substituted for L{Q, Jf (^) in the 
left-hand sides of (4-9) and (4-10). Then 

=^+(a+o/(C)+A^(C) 

= J[(a+t^) 8(t)+fT{t)\e-<.tdt+^^8{t)e-»dt. 

If the second term is integrated by parts this becomes 

Mm, mi = J[{a+«^) 





(4-13) 

provided 

{a.+t^)S+^+pT = 

• 

(4-14) 

Similarly 

Mim),m'\ = [T{t) 


{4-15) 

provided 

fjrp 

{a+t^)T-^+fi8 = b. 


(4-16) 


The normal procedure in applying a Laplace transformation to obtain a solution 
of a pair of differential equations, would now be to specify the contour so that 
J{Q] and /(^)'] are both zero.f This is not possible in the present 

ease, but it will be seen later that if we define a finite series of functions -4(S)> 
etc., by the relations 

Sflt}e-S^dt, ^■(C)=f T,.{t)e-i‘dt, 

J Fj J Fj 

where and T^{t) satisfy (4-14) and (4-16), the integrals being taken along 
different contours, then it is possible to choose constants c&g, etc., so that 

= (4-17) 

= (4-18) 

The functions L{Z) = M(^} = Za^JjiQ 

then constitute a solution of (4-9) and (4*10). 

It will be convenient at this stage to make a change of notation. Write 

T = uif) exp (Ji®+ at), (4-19) 

* 

* Square brackets are used to denote the difference of the values of the enclosed function 
at the two ends of the contour, 
t Cf. Watson ( 1944 , p. 160). 



140 


M: V. Wilkes 


then from (4i6) 

s = ^ ^[^exp(J«®+ai). 

(4-20) 

Thus 


(4-21) 


j (C) = J «(0 exp - (C- “) 

(4-22) 

By substituting for 8 and T in (4*14) it may be shown that the equation satisfied 

by u{t) is 


(4-23) 



Equation (4-23) mil be solved in a later section; for tbe time being it will be 
sufficient to say that it has an essential singularity at infinity, but that if attention 
is confined to one of the shaded sectors (numbered 1, 2, 3) in figure 2, it is possible 
to obtain a solution which is regular at aU finite points of the sector, and which tends 
to a constant value at infinity. There are three solutions of this kind, %(t), 
corresponding to the three sectors. It is not possible to obtain a solution which is 
regular and tends to zero at infinity in two specified sectors at the same and such 

that M and dujdt are continuous at the origin. It is for this reason that L(Q and 
if (0 cannot be expressed in terms of a single integral along a contour starting at 
infinity in one sector and ending at infinity in another, as is possible in the case of 
Airy’s integral. 



Oblique reflexion of very long mreless waves from the ionosphere 141 

Now define the following functions (of. equations {4-21) and (4-22)): 

4(C) = j exp - (C- a) t] dt, (4-24) 

4(C) = jun(t) exp [^^3 - a)«] dt, (4-25) 

where n has the value of 1 , 2 or 3. The contour of integration is any line, which, lying 
wholly in the sector n, starts at the origin, and ends at infinity. 

If use is made of equations (4-13), (4-15), (4*19) and (4-20) it follows that 

^^[4(0.4(01 = -M.(0), (4-26) 

=/?-%; (0). (4-27) 

It win be noted that these remainders are independent of 

By restricting attention to three sectors only of the complex plane, and by 
imposing the condition that u^(t) tends to a constant value at infinity, some loss of 
generality is incurred. This may be restored in the foUowing manner. Note that 
equations (4-9) and (4*10) are interchanged if L and M are interchanged and 7 -^ 
is written for Define 

4(C) = 4(-C). (4-28) 

4(C) = 4(-C). (4-29) 

It is easily shown that i2jif[4(C)j 4(C)] = A“^“n(0), (4-30) 

i2n[4(C),4(C)] = -«J0). (4-31) 

It win be seen in §4(c), from a consideration of the asymptotic forms, that the 
six paira of functions 

4(C), 4(C), 4(C), 4(C) 

are linearly independent. Now take a linear combination of the functions, and 
write 

L(0 = +^44(0, (4-32) 

M(0 = 2 'o„ 4 (C)+ro;4{C), ( 4 - 33 ) 

where independent of A solution of equations (4'9) and (4-10) follows 

provided 

-^a>„(0) = 0, (4*34) 

^a„^i«(0) -r<y?-%;(0) = 0. (4-35) 

Moreover, since any four of the sis quantities o„, a'„ may be specified arbitrarily, it 
follows that the expressions (4-32) and (4-33) constitute the general solution of (4-9) 
and (4-10). 



142 


M. Y. wakes 


A formal solution of the problem in terms of a solution of the second order differen¬ 
tial equation (4*23) for u{t) has thus been obtained. The next section wiU be devoted 
to the study of this equation. 

(6) Solution of the equation for u{t) 

In all cases to which this analysis is applied is a small quantity. It is therefore 
natural to seek a solution of equation (4-23) in powers of Accordingly, take the 
trial solution 

= 1 i(i) + ..., (4-36) 


whereetc., all tend to zero as | f | tends to infinity in the sector n, so 
that tends to a constant value. Substituting and equating coefficients of 


power, om we have + . _ o, 

(4-37) 

f:Ai) + 2(cc+t^)fU*)+fn.i(f) = 0 , 

(4-38) 

f^Jt) + 2(a + +A,r_i(f) = 0 , 

(4-39) 

where the dashes denote differentiation with respect to the argument. These are all 
linear differential equations of the first order, and may be solved by mAana of an 
integrating factor. Hence 

/w,i(0 = ~ exp[-2(|^»+a^)]J^exp[2(|5®4'aa)]cfe, 

(4-40) 

f^A^) = - exp[-2(J«®+a<)]£/„,,_i(s)exp[2(is3+as)]ds, 

(4-41) 


where the integrals are taken along contours in the s plane starting at the points 
and ending at the point t. 

As mentioned above it will be sufficient to confine attention to the domains of 
the t plane which are shaded in figure 2 ; it will be shown that if the point 4 is 
suitably chosen, /„,,(<) exists and tends to zero as |«| tends to infinity. 

r /•-!<* ^ ~| 

~( — 18 )i|_Jo Jo ■^ten I iI is large, 

using the weU-known asymptotic expansion for the incomplete Gamma function. 
Take fj to be the point at i nfin ity in the direction indicated by the arrow in the 
particular shaded sector under discussion, i.e. 4 = e„oo, where 

e„ = exp[|( 27 i-l) 7 ri], 

S'nA*) - 


SO that 
and hence 


(4-42) 




Oblique reflexion of very long wireless waves from the ionosphere 143 
It follows that as I ^ I tends to infinity in the shaded sector under consideration 

where A-^ is a constant, depending on the choice of the lower limit of integration. It 
is convenient to take this to be the point ^ = 0 , so that 


/«.!{«)= -j‘ 


exp [ — 2(Jr® + ar)] dr f exp [2(Js® -f- as)] ds. (4-43) 

J en°o 


There is no loss of generality in taking i = 0 for the lower limit of integration, since 
it may he shown that the effect of taking any other value would he merely to multiply 
the value of u^(t) given by (4-36) by a constant. 

Return now to (4*41), take tj. = e^oo, and integrate along a contour 

starting at the origin. 

fv,r(i) = - f exp [- 2 (+ar)] dr f fn,r-i(s) exp [ 2 (is 3 +( 4 . 44 ) 

JO J €nOO 


It may be shown by induction that,.(f) exists and tends to the form as 

1 1 1 tends to infinity. 

In evaluating the integrals in equations (4-24), (4-25) values of/„ ^(0 only be 

required for points lying on straight lines radiating from the origin at angles Jtt, tt 
and f TT to the real axis, i.e. in the direction of the arrows in figure 2, and it is therefore 
convenient to write 

r = s = cre^e^^, t == (4*45) 

where | ^ | < Jtt and p, cr, r are real and positive. Ultimately the values for 

which ^ = 0 will be required. 

It will now be proved that the series (4-36) is uniformly convergent with respect 
to ^ in a domain of the complex plane containing the line 9=0, provided that 
is less than a certain quantity. 


fn f [%p^e^^^ — 2ap€^ 6^^] dp f exp [ — -f 2acr€,^ e'^^J dcr. 

' Jo J p 

Taking the moduli of the integrands, and replacing the upper limit of the integration 
in p by infinity, we have 

|/n,i(Te„e^) i < exp[fp3cos3(9-2apeos(^2?i-l7r+5)]d:p 

I exp [—I cr® cos 39+2acr cos {\2n — Itt + 9)] da 
Jo 


= sec^ 39 J exp [|p®— 2 ap sec^ 39 cos (J 2 n—I tt 4 * 9}] dp 
I exp [ — |cr® 4- 2acr sec^ 39 cos(J 2 r 2 .—Izr 4 9)] da 

J p 


^KioLyd). 



144 M. V. Wilkes 

That K{cc, 6) exists may be proved by an argument similar to that used to prove the 
existence 

By replacing 1 ( 5 ) in the integral for/^ 2 (^) i^ 

and continuing the process that 

\fnJt)\<[K(a,d)r^ 

It now follows that the series (4-36) converges uniformly with respect to t in the 
domain defined hj — 0 q < 6 < 6 q provided that ^^<1 IK{oc, 6q), where jS^(a, 6 ^) = upper 
bound of ^(a, 6) in the domain. K{a^ 6 q) is a continuous function of Oq, and if 

where .4 is a constant, there exists a domain, containing the line ^ = 0 , in every 
bounded subdomain of which the series (4*36) is convergent uniformly with respect 
to t. Within this domain, differentiation of the series (4* 36) term by term is legitimate, 
and its sum therefore represents a solution of equation (4*23). 

Thus by taking n in equation (4*44) equal to 1 , 2 , 3 in turn and substituting for 
/«,r{0 ki (4-36), expressions are obtained for the quantities denoted in §4 (a) by 

These may now be substituted in equations (4-24), (4*25) to give 4(^), J^(^) from 
which solutions of the differential equations (4*9) and (4*10), governing propagation 
in the reflecting region, may be constructed in the manner explained at the end of 
§4(a). 


(c) Asymptotic forms 

The following asymptotic formulae valid when ^ is large may now be estabfished: 



(4-46) 


(4-47) 

‘4(C) ~ ■“aCco) V^(C- cc)-i exp [f (f - a)*], 

(4-48) 

■4(0 ~ ‘A(C - «)“* exp [|(^- a)t], 

(4-49) 

‘4(0~“®^^V“*^^^+ o (i), 

(4-50) 

^3(0 ~ + 0 (i) J, 

(4-51) 

‘4'(0 ~i^“^C'iVMC+a)~*exp [|(C+a)»i + J7ri], 

(4-52) 



Oblique reflexion of very long wireless waves from the ionosphere 

liiZ) ~ %(oo) exp [f (C+a)* i + Ini], 




- + 0 


(?)]• 


145 

(4-53) 

(4-54) 


/,-K)~-^+ “i(°)W<') ^0(i), (4-55) 

J3{0~^~^02^n{^+a)-iexpl-^{^+a)ii-lni], (4-56) 

4'(?) ~ « 3 (oo) fn{^ +a)-i exp [ - |(C+«)* i - (4-57) 

In these formulae, 'Z^^(oo) denotes the limiting value of u^{t) as | ^ | tends to infinity, 
and (7^ is a constant defined by 




when I i I is large. 

The formula for J^(^) may be proved as follows. Write 


9i{^) = % W {¥^+; 

then, by repeated integration by parts, 

/•ejco 

Ji(0 = jo e-SVi(<)* 

Since the term in 1 tends to zero as ^ tends to infinity, this is an asymptotic series 
for Expressed in terms of %(^), the first two terms reduce to form (4-46). 

A similar argument may be used to establish the validity of the expansions (4*47), 
(4-60), (4-51), (4-54) and (4-55). 

The remaining expressions may be obtained by the method of steepest descents. 
Consider, for example. 

The exponential function has saddle points at ^ = + of which one—corre¬ 

sponding to the lower sign—^lies in the sector 2; if the contour of integration is taken 
to pass through this point along a line of steepest slope, the expression given in 
(4-48) is easily obtained. The same method may be applied to obtain (4*49). 

There is a slight complication in applying the method of steepest descents to the 
other functions, for example, e7i(Q = D because the saddle points of the 
exponential function occur at the points t = ±{^+oc)^ i, which lie on the boundaries 
of the sectors within which u{t) is finite for large L It may, however, be shown that 
when ^ is large, the important contribution to each integral comes from those 
portions of the contours which are remote from the origin, and that the leading 



146 


M. V, Wilkes 


terms in the asymptotic expansions may be obtained by putting in the Kmiting 
forms for ujt) and u'Jt), i.e. ujoo) and CJt^ and evaluating the resulting integrals 
by the method of steepest descents in the usual manner. 

A study of the asymptotic forms given in this section shows that, as already 
stated in §4(a), the six pairs of functions In{0,Jn(0^^n{0>Jn{0 are Knearly 
independent. 


5. Application to the ionosphere 

A solution of (4-9) and (4-10) representing the most general wave which can be 
propagated in the reflecting region has now been obtained. The next step in applying 
the analysis to the ionosphere is to impose on this solution two conditions, (a) that it 
should be finite at infini ty, and ( 6 ), that it should be continuous, at the lower 
boundary of the reflecting region, with expressions representing the waves (upgoing 
and downgoing) propagated in the region immediately below. If the effect of the 
transition region is neglected, the latter is equivalent to free space, and the analysis 
follows the lines of that in Wilkes ( 1940 , p. 149), but generalized to allow for oblique 
incidence. 

In order to carry out the procedure outlined above, it is necessary to evaluate the 
ftinctions derivatives at the lower boundary of the reflecting 

region, where ^ is zero or has some given small value. If is sufficiently small for its 
second and higher power to be neglected, only two terms of the series for u^{t) are 
needed, and it is found that IniQ, Jn(Q i^aay be expressed by means of power series 
in a form suitable for computation. This approximation is sufficiently accurate for 
the discussion of the experimental results on the propagation of 16kyc./sec. waves 
referred to in the Introduction, since it wiU be found on substitution in (4-8) that 
even when d is as great as 60°, ^ is only 0*13, while for smaller values of 6 it is even 
smaller. On higher frequencies the approximation obtained by neglecting will 
correspond to reflexion at small angles of incidence. 

The calculations will not be carried through in detail in the present paper, but it 
is found that the four linearly independent expressions for L{Q and M{Q obtained 
from (4-32) and (4-33) by assigning values to the six constants aj,,, subject to the two 
conditions (4-34) and (4*35), are equivalent to the solution for yff = 0 plus a correcting 
term of order /?. It follows that the error in calculating the wave reflected from the 
medium on the basis of a solution for vertical incidence is of the order . One is 
therefore justified in using the results of the analysis of Wilkes ( 1940 ) to give a first 
approximation in the case of oblique reflexion at small angles of incidence. 

I would like to express my thanks to Dr D. Smithies for several helpful discussions 
and to Mr J. A. Ratcliffe for advice as to the final form the paper should take. I am 
also indebted to Professor D. R. Hartree, P.R.S., Dr H. G. Booker and Mr G. Milling¬ 
ton for reading the manuscript and commenting on it, and to Dr and Mrs Jeffreys 
for kindly allowing me to see in proof certain sections of their forthcoming book. 



Oblique reflexion of very long wireless waves from (he ionosphere 147 


References 

Best, Ratclifife & Wilkes 1936 Proc. Roy. Soc. A, 156, 614. 

Booker 1934 Ptoc. Roy. Soo. A, 147, 352. 

Booker 1935 Proc. Roy. Soc. A, 150, 267. 

Booker 1936 Proc. Roy. Soc. A, 155, 235. 

Booker 1938 Phil. Trans. A, 237, 411. 

Budden, Bateliffe & Wilkes 1939 Proc. Roy. Soc. A, 171, 188. 

Chapman 1931 a Proc. Phys. Soc. 43, 26. 

Chapman 19316 Proc. Phys. Soc. 43, 483. 

Chapman 1939 Proc. Phys. Soc. 51, 93. 

Fdrsterling & Lassen 1933 Phys., Lpz., 18, 26. 

Hartree 1931 Proc. Roy. Soc. A, 131, 428. 

Havelock 1914 Camb. Math. Tract, no. 17. . ^ , tt • t» 

Jefireys & Jeffi^eys 1946 Methods of Mathematical Physics. Camb. Univ. Press. 
Miller, J. C. P. 1946 British Association Mathematical Tables, Part-volume B, 
prepared by J. C. P. Miller. 

Taylor 1934 Proc. Phys. Soc. 46, 408, 

Watson 1944 Theory of Bessel Functions, 2nd ed.. Camb. Lmv. Press. 

Wilkes 1940 Proc. Roy. Soc. A, 175, 143. 




The fine structure of the spectral lines of Hg*. 
measurements with a reflexion echelon 

By E. W. Foster aed Sib Owen Richabdsoit, F.R.S. 

{Received 5 December 1944) 

[Plate 4]. 

The jfine structure of the lines of the spectrum was discovered by Richardson & Williams 
in 1931, and a short account of their observations was published in a letter to Nature at 
the time. Since then no further publication on the subject has appeared. The present paper 
represents the results of experiments and observations made during the last six years with 
improved apparatus, including the Royal Society’s 40 plate reflexion echelon. About 250 
lines at the red end of the visible spectrum have been examined and their flne structures 
photographed and measured. These include most of the important lines of the triplet systems 
3 p®i 7 -» 25 ®Z' and as well as important lines of the system 

and 3F->2s^i7, together with some lines of the triplet system The singlets are 

represented by about 20 lines coming from the states Zd^U^, 25^2*, Z^K, 

and and ending in 2p^N or belonging to the system Zp^II-^2s^E. There is also a small 
proportion of unclassified lines. 

The present paper is limited to a descriptive account of the observations and the analysis 
and tabulation of the data obtained. The analysis of these data has already been carried 
out and the results will be published separately. 

The plate which reproduces a photograph of the crossed spectra of the stronger lines dealt 
with is the jSrst published picture showing the fine structure under consideration. 


1 . Inteodtjotiok 

This fine structure was discovered by Richardson & WilKams in 1931 and the only 
published description of it is in a letter to Nature (Richardson & Williams 1931 ) 
which is brief and provisional. The observations were made with a system consisting 
of a large Hilger quartz spectrograph crossed with a reflexion echelon. Those now 
to be described were made by the same method using a larger echelon (40 plates) 
and other improvements. The apparatus was, in fact, the same as that used by 
Drinkwater, Richardson & Williams ( 1940 ) to obtain the results published in the 
paper on ‘Determinations of the Rydberg constants, e/m, and the fine structures of 
and D^ by means of a reflexion echelon’ apart from modifications adopted to 
meet the changed conditions of excitation and character of the lines. Altogether 
244 lines measured by Gale, Monk & Lee ( 1928 ) have been photographed and 
measured and are here described and analysed. Actually the number of lines is 
considerably more than 244, as many of these are blends. This often increased the 
difiiculty of interpretation. A photograph of the fine-structure patterns of the 
strongest of them is reproduced in plate 4. They lie between the wave numbers 
17741-92 and 15443-06 (A 5634-807(8) to 6473-615(2)). 

The photographing of the spectra was all done in 1938-39 and was brought to a 
sudden end by the events of September 1939, which included the evacuation of 


Vol. 189. A. (17 April 1947) 


[ 149 ] 


10 



150 


E* W. Foster and Sir Owen Richardson 


King’s College, London. This accounts for the limited range of the spectrum 
represented. However, it is much the most interesting and important part, having 
regard to the resolution which can be brought to bear in practice on this spectrum 
at the present time. There were other regions which we should have looked into 
had there been time. 

The work subsequent to September 1939 was carried on mainly at Chandos Lodge, 
Alton, Hampshire. This included all measurements of the plates and their reduction, 
the enlarging and photometry and the analysis and interpretation of the resulting 
data. 

The scope of this paper is confined to a description of the experimental results. 
The delay in publication is due to the frustrations and interruptions which are 
inevitable under conditions of total war. The analysis of these results will be dealt 
with in subsequent papers. 


2. SpECTROGRAPHIC EQUXPIVIENT AIS-D METHODS 

The speetrographic equipment consisted of a reflexion echelon (40 plates, step 
width 6-88 mm.) and a BBlger (E1 type) Littrow spectrograph fitted with a glass 
optical train. The two instruments were crossed so that the hi gh dispersion of the 
echelon was in a vertical plane perpendicular to the auxiliary coarse dispersion of 
the Littrow instrument. Since the particular dispersing apparatus used by us has 
already been described in detail (Wilhams 1933 > Williams & Middleton 1939 ) we 
shall not here do more than briefly refer to it. 

A mir ror is mounted at each side of the echelon almost parallel to the reflecting 
faces of the steps. The normals to these mirrors are slightly inclined on opposite 
sides of the normal to the reflecting faces of the echelon so that the images of the 
echelon collimator slit in the mirrors provide a series of reference marks M^, 
plate 4) which are used for plate measurement. These marks also determine the 
change in the magnification of the optical system with wave-length. 

The echelon is mounted in a vacuum chamber and molecular lines were measured 
directly (§6) in terms of the vacuum wave number of the standard Cd red line 
''W®438'4696A or Apac. 6440'2491 A. Additional measurements were made with 
the vacuum chamber filled with air at a pressure of a few centimetres of mercuIy^ 
The inter-order dispersion was obtained from measurements of the separation of 
very sharp argon lines appearing in approximately the double-order position. 

On each plate it was usual to record the standard Cd fine several times with dif¬ 
ferent exposures both before and after an exposure of the molecular spectrum. The 
argon spectrum also was usually exposed after the molecular spectrum. Unless 
there was good reason to do otherwise a plate was discarded if the Cd exposures 
before and after were not in agreement. This happened in only one case (when it is 
beheved that some mechanical shock had disturbed the relative orientation of the 
echelon and its mirrors) after precautions to keep the laboratory temperature 

constant {± 0-1° C) had been taken. 



The fine structure of the spectral lines of hydrogen 151 

The molecular spectrum was photographed under high dispersion between about 
5000 and 6500 A, using Ilford "HjpersensitiTe Panchromatic^ plates. The faintest 
structures recorded required an exposure of about 30 hr. 

The spectrum was measured between v = 17859*76 and p = 15443*06. The method 
of computing the wave numbers of the rather broad molecular Imes is dealt with 
in §6. 


3. Excitation of the molecular hydrogen spectrl^i 

The spectrum was observed ‘end on’ through a window on one of the 30 cm. 
long vertical limbs of a U-shaped bend in the central part of a pyrex discharge tube, 
of overall length 180 cm. and uniform internal diameter about 1 cm. The ends of the 
tube widened to contain hollow cylindrical thin sheet-nickel electrodes, 12 cm, long, 
2*5 cm. diameter. 

Arrangements were provided for exhausting the tube by two-stage mercury 
diffusion pumps, one at each end of the tube. Liquid air-cooled traps were provided 
to prevent mercury vapour from entering the discharge. At one end of the discharge 
tube pure hydrogen could be admitted from a reservoir by diffusion through an 
electrically heated palladium tube. A small McLeod-type gauge was joined close to 
the pump comiexion at the other end. A hquid air-cooled charcoal trap could also 
be connected to the discharge tube to counteract air leakage; it had the disadvantage 
of slowly absorbing hydrogen, but it was connected to the discharge tube during a 
long exposure as a precaution. 

The discharge tube and electrodes were brought to a very clean state by the pro¬ 
longed passage of a discharge in hydrogen with repeated pumping and flushing out 
with pure gas. In the early stages the discharge had the usual pinkish tinge, some¬ 
times with striations, and was very strong compared with any molecular lines in 
the visible spectrum. When the last traces of impurity were gone the positive column 
became white everywhere, and in this condition, which was always used when photo¬ 
graphing the molecular spectrxim, the strongest yellow and red molecular lines were 
comparable in visual intensity with Subsequently no impurities of any descrip¬ 
tion were ever observed in the hydrogen discharge unless purposely admitted. 

For photographing the molecular spectrum the tube contained hydrogen at a 
pressure of about 0*15 mm. Hg, which was excited to the white stage by a direct 
current of about 20 mA. The U-shaped bend in the tube was conveniently cooled 
by immersion in a deep Dewar flask containing liquid oxygen. This sharpened the 
molecular lines very considerably and enabled the resolution of many hitherto 
unknown structures. 


4. Excitation of the argon spectrum ^ 

Very sharp argon lines were easily and conveniently excited in the hydrogen tube 
described in § 3. To this was attached a reservoir of spectroscopically pure argon, 
and a few flushings with this gas while pacing an alternating current discharge, at 


ia-2 



152 E. W. Foster and Sir Owen Richardson 

about 100 roA, sufficed to remove residual hydrogen almost completely (H„ appeared 
very faintly). The converse process more rigorously applied was used to remove 
argon before an exposure. When photographing the argon spectrum the dis¬ 
charge tube bend was cooled in liquid oxygen. 

5. Excitation of the standabd cadmium bed line 

The Cd red line was excited in a water-cooled hollow cathode tube very similar 
to that described by Schuler ( 1931 ). The iron cathode, about 3 cm. deep, 1 cm. bore, 
contained a few pieces of metallic cadmium. It was found that the red line could be 
strongly excited in such a tube (exposure tune about 2 min.) at lower current (about 
80 mA) using pure argon as the carrier gas instead of the more usual helium (about 
200 mA), presumably due to the greater sputtering of the cadmium in argon. This 
resulted in such an obviously sharper contour for the red line that argon was adopted 
in spite of the more elaborate gas purification system required. 

Pure argon, at a pressure of about 0*05 mm. Hg, was circulated continuously 
through the Cd tube. The purification was effected by including in the circulator 
system a second (Ao^-cathode) Schuler tube containing metallic calcium. This tube 
passed a current of several amperes and effectively absorbed the inevitable residue 
of oxygen and nitrogen. Residual hydrogen was removed in the usual manner by 
passing the gas through a quartz tube containing red-hot copper oxide. Water 
vapour and remaining impurities were condensed in a liquid oxygen-cooled trap. 
The continuous circulation was effected by a two-stage mercury diffusion pump. 

The wave number of the Cd line as excited by this source was not compared with 
the wave number from a standard source. Williams & Gogate ( 1938 ), however, have 
compared the wave number from a Cd-He Schuler tube indirectly with a standard 
Cd source and have found no appreciable difference. We have assumed that the 
substitution of argon for helium carrier gas will make no difference to the wave 
number of the Cd line. 

6. Measubement and beduction of plates 

As will be seen from plate 4 (the dispersion is 0*726 cm""^ per order) the molecular 
lines are quite broad and, for reasons discussed below, it was not possible, in general, 
to reduce measurements directly by the simple method devised by Williams ( 1931 ) 
for sharp lines and thereby obtain results of the desired accuracy. 

The equation of the echelon intensity envelope may be written 

where 

s being the echelon step width, A the wave-length and a and jS the (very small) 
angles of incidence and reflexion, for the reflecting surface of the step. When X = 0 
a line is said to be in single-order position; in which case, if the natural intensity 
distribution in the line is symmetrical, it will remain symmetrical in the diffraction 



The fine structure of the spectral lines of hydrogen 153 

pattern. For all other values of X between + tt the original sjnximetry will be dis¬ 
torted by the envelope effect. 

It appears that the position of the ‘optical centre of gravity’ of a distorted com¬ 
ponent in the echelon pattern can be measured to a good approximation by the 
method described in the following paragraph. It is not obvious why this should be 
SO 5 but we adopted the method in spite of its empirical nature since with the time 
and apparatus available it would have been impossible to microphotometer more 
than a small fraction of the lines actually measured, and owing to the considerable 
blending in the coarse dispersion it is doubtful whether the accuracy of the visual 
measures could have been improved much in very many cases. 

Measurements were made with a specially constructed Hilger measuring micro¬ 
scope (Williams & Middleton 1939 ), the unusual feature of which was a travelling 
stage. On any one plate the position of a component was measured with respect to 
the reference marks Jfi and associated with the line in question. As a rule, twelve 
settings "were made on the visually estimated centre of intensity of the component 
and six on each of the (considerably sharper) associated reference marks. Sis of 
the settings on the component and three of those on the reference marks were made 
traversing the plate in the direction of increasing order of interference, and corre¬ 
sponding settings were made while traversing in the opposite direction. To change 
from the first measuring position to the second, the plate was rotated through 180° 
about a normal axis so that the microscope stage required moving in the same direc¬ 
tion, whether for measurements in the direction of increasing or decreasing order of 
interference. The mean of the two sets of measures was taken. When a component 
appeared in more than one order all orders were measured, but it seldom happened 
that more than two orders were visible. The weakest lines were measured on only one 
plate but the majority have been measured on five or six plates. 

The problem remaining was to relate the position thus obtained for the ‘ optical 
centre of gravity’ of the component with the position which the component would 
assume if its width is imagined to become vanishingly small, i.e. with the limiting 
position of the line, as the temperature of the source tends to zero. When this 
is solved the calculation of wave number may be concluded by Williams’s method 
referred to above. 

The procedure adopted was as follows: 

(i) The characteristic linear inter-order dispersion was obtained by measurement 
of several sharp argon lines appearing in almost symmetrical double-order position,* 
^ = ±i- 7 :r (when the intensities in the two orders are equal). These dispersions 
were reduced to the value for the Cd red line, and it was assumed that this was the 

* Cases where this is approximately realized for lines of the Hg spectrum can be seen on 
plate 4; for the foUowing lines of 3p 25 15552-37 (10), 4 -> 4, Q1,15859-86 (5), 3 ->3, R 0, 

16121-52(8), 22, RQ, 16314-99(4), 0-»0,P4, 16440-72(10), 1-^1, PI, 16611-43(10), 0->0, 
Q 1; for the 2->0, PI line of 2p^Z->2s^Zz 15669-39(4); for the high-frequency component 
of the following lines ending on 2p ^17 (the low-frequency components which lie to the left are 
imdsible in the left-hand orders owing to the envelope effect) 16458-03(5), 2d^n^-^2p^2J. 
2-^3, Rl, 16654-24(9), 3d^Z-^2pm, 0^0, Q3and 16692-00(5), 3dm^-^2pm, 1^1, P3. 



154 E, W. Foster and Sir Owen Richardson 

dispersion for infinitely sharp lines. Similar measurements were made on a number 
dF molecular lines also appearing in (or very nearly in) the double-order position 
(the equality of intensity was checked microphotometrically), and the apparent 
linear dispersion at A Cd red was computed for these lines also. The value was less 
than the value for the sharp argon hnes by about 0-02 order. One-hah the difference 
between the argon and hydrogen order separations was taken as the displacement 
of a typical hydrogen line towards the centre of the intensity envelope at X = ± ^tt. 
We refer later to this quantity as the ‘half-order defect’. 

(ii) The position of the intensity maximum relative to the midpoint of the mirror 
marks was found approximately for any plate by determination of the distance 
between the position of the midpoint of the mirror marks and the midpoint of the 
orders of a line in double-order position. A line was considered to be in double-order 
position when microphotometer measures showed that the intensities in the two 
orders were equal. Having found the approximate position of the intensity maximum 
relative to the midpoint of the mirror marks it was a simple matter to find an approxi¬ 
mate value of the fractional part /j of every component referred to the intensity 
maximum, from the fractional part/^ derived directly from the measurements. 

(in) It was assumed that for a line having a fractional part/j the displacement D 
towards the centre of the intensity curve would be proportional to the slope of that 
curve,, i.e. 

2KsmX.^ - . 

D = — ^— {X cosX —smX) 

at the point where the line occurred. D was plotted against X with the value of K 
adjusted so that the half-order defect was equal to D when X ~ 

(iv) The defect in the fractional part for a line in any position could then be found. 
This was added to the observed fractional part/^ giving the position which the line 
would have occupied had it been infinitely sharp. The remainder of the calculation 
was carried out by the method of Williams. 

Results for the same line in different orders and in different positions (position 
varied by changing pressure in the echelon chamber) showed good agreement, so 
that for practical purposes the assumptions made were justified. 


7. Ox THE POSSIBILITY OF SYSTEMATIC EEROM IX THE RESULTS FROM MEASURE- 
MEXTS OX PARTICULAR PLATES AND COMPARISOX OF MEAX RESULTS WITH 

Gale, Moxk axd Lee’s data 

Gale et al, ( 1928 ) interferometer measures on eleven single, or substantially single, 
lines are compared with our individual results from particular plates and with 
the means of our results in table 1 . The eleven lines are aU members of the system 
n~^ 2 s S. There is little doubt that these lines have some structure, but for 
practical purposes they are observationaUy simple with the resolution that can be 
brought to bear on them at present. 



155 


The fine structure of the spectral lines of hydrogen 

Wave numbers in italic tj^e have been obtained from the measurement of one 
order of a line only; in ordinary type they have all been obtained on measurements 
on tvvo orders. 

The ‘weighted mean of all our results = Vp" may iuclude results other than those 
given in the table. For comparison of results on individual plates, only results from 


measurements on the same number of orders for a given line on any one plate have 
been included. 

Table 1 

weighted 

plate n\imber 
G.M.L. 

40 

S3 84 85 

our result (decimal part only) 

86 

mean or 

all our 
results 

defect 

= % 

(G.M.L. result) 

— (our result) 

= ‘defect' 

= Vp 

VQ-Vp 

15552*37* (10) 

-327 

•334 

•328 

•320 

■327 

•327 

*043 

4-^4, Q1 

-043 

1 *036 

•042 

•050 

•043 

? ± -008 


15621-96* (10) 

•962 

*963 

•959 

•958 

•958 

•960 

•000 

3-^3, P3 

--002 

-*003 

•001 

•002 

•002 

± *005 


15767-10* (7) 

•062 

•071 

•067 

•070 

•068 

•067 

•033 

3-^3, <?3 

•OSS 

•029 

•033 

•030 

•032 

±•006 


15787-23* (5) 

•218 

•216 

•213 

•219 

•206 

•212 

•018 

3-^3, Q2 and ? J 

•012 

•014 

•017 

•on 

•024 

i •OOS 


15800*76* (10) 

•739 

•748 

•740 

•744 

•746 

•743 

•017 

3^3, Q 1 and 

•021 

•012 

•020 

•016 

•014 

±•008 


15870-10* (10) 

•066 

•067 

•068 

•057 

•058 

•063 

•037 

2 -^2, P3 

•034 

•033 

•032 

■043 

•042 

±•006 


16046-28* (7) 

•249 

•251 

•253 

•249 

•258 

’ -252 

•038 

2-»2, Q2 

•031 

•029 

•027 

•031 

•022 

±•006 


16121-52* (8) 

-490 

•498 

•494 

•491 

•486 

•492 

•028 

2->2, PO 

•030 

•022 

•026 

.029 

•034 

±•004 


16168-95* (10) 

•915 

•916 

•914 

•912 

•918 

•915 

*035 

2->2, PI 

•035 

•034 

•036 

•038 

•032 

±•004 


16330-59* (10) 

•567 

•566 

•568 

•569 

•571 

•576 

•014 

1-^1, ei 

•023 

•024 

•022 

•021 

•019 

±•006 


16611-43* (10) 

•405 

•401 

•400 

•396 

•395 

•399 

•031 

0 ->0, $1 

*025 

•029 

•030 

•034 

•035 

±•003 


mean defects 

•026 

•023 

•026 

•028 

•028 

— 

•027 


f See Table 3 for possible weak blends. 


The wave number of the line 15,552*37 may be affected by the nearby 15549*47 (3) 
3p ^iT-> 2s 3 3, P 4 which does not appear on our plates. 

From table 1, and an expanded table of the same form which includes lines not 
measured vith the interferometer by Gale et aL, it is concluded that our plates give 
self-consistent results and that there is no systematic error relative to the mean 
result from all others, in any one of them > c. 0*002 cm.-^. 



156 


E. W. Foster and Sir Owen Eichardson 


8. OlsT THE POSSIBILITY OP OUR MEAIST RESULTS BEIISTG 
SUBJECT TO A SYSTEMATIC ERROR 

The H 2 lines were not measured directly in terms of the cadmium red line as 
an internal standard in the discharge tube, but one plate (no. 39) was obtained 
with a discharge atmosphere consisting of hydrogen and argon, and a few hydrogen 
lines given in table 2 were measured in terms of an accurately known argon line 
{v = 15580*984o ± 0*0005 cm.-^) which had been measured previously against the 
cadmium line as internal standard (plate no. 79). The results should, for practical 
purposes, be as significant as if the Hg lines had been measured against cadmium 
as an internal standard in a hydrogen discharge. 


Table 2 



15552-37* 

15767-lOt 

16126-19t 

16440-72t 

16611-43t 


•327 

•067 

•149 

•693 

•399 


•341 

•065§ 

•144§ 

•690 

•404 


-043 

•033 

•041 

•027 

•031 


•029 

•035 

•046 

•030 

•026 


* See remark on this line on p. 155. 
t Grale et aL interferometer measures. 

X Figures in this row are taken from table 3. 

§ Measured in one order only; the other figures in this row are from two orders. 


From table 2 are obtained the mean defects: 

% — Pjp, = *03o cm.“^ and Vq — — •033*cm.“^. 

Hence it is concluded that there is no important systematic error in our final (p^) 
, results, of table 3, unless there is some fundamental error of method. 


9. 02s THE DIPPERENCE BETWEEN OUR RESULTS AHD THOSE 

OP Gale, Mohh ahd Lee 

From tables 1 to 3 it is clear that there is an appreciable difference between our 
vacuum wave numbers and those of Gale et al The irregular part of this difference 
in some eases is mainly due to the difficulty of ehminating completely the inter¬ 
ference of neighbouring lines in this closely packed spectrum. But there is evidently 
a systematic difference of roughly 0*03 cm.-^, in the region 15,500 to 16,600 wave 
numbers, our values being lower. For this region, about 0*015 cm.-i of the difference 
is accounted for by the different air-dispersion formulae used in the two cases.* 

There seems to be no reason to doubt the vahdity of our results within the limits 
given. The residual difference of about 0*015 cm.-i (0*006 A.) is difficult to 
account for. 

* Gale et al measurements were reduced with the formula of Meggers & Peters ( 1918 ), 
whilst m our case the vacuum wave number of the Cd line was computed using the formula of 
Sears & Harrell ( 1939 ). ® 



157 


The fine structure of the spectral lines of hydrogen 

10. Oy DEDUCES G THE STEUCTUBES 

In this, the minimum amount of attention was paid to what was expected of a 
line, but in some of the more confused and badly blended lines it was often necessary 
to have some guidance as to structure besides what was provided by our observa¬ 
tions. The structures given may not invariably be a unique interpretation of the 
observations in such cases, but, coupled with making a tolerably good fit with the v 
of Gale et ah and a reasonable balance with their intensities and also a fair fit with 
what might be expected fi:om the stucture and properties of related lines for which 
the determinations are free from ambiguity, it is believed that the structures are 
substantially correct m most, if not all, cases. In the relatively few cases where our 
results show a substantial disagreement with those of Gale et al., it may sometimes 
be taken as a sign that something is amiss, but this does not always apply, since the 
poorest fits are mostly concerned with lines that were evidently not properly resolved 
by Gale et aL, and hence probably not accurately measured. Moreover, the 
intensity distribution in our spectrum and that of Gale et aL or of Merton & Barratt 
( 1922 ) is quite appreciably different. On our plates, as a rule, lines with high 
rotational quantxnn number (say 7i > 3) are relatively quite weak compared with 
the Gale et aL value, so that m complex lines mvolviug, for example, one such line 
and a low rotational line, comparatively large differences in the composite v may 
be tolerable. 


, 11, Example of analysis op a coimplex steuctube 

We shall now describe an example of the treatment used in one of these, relatively 
few, instances where the echelon observations together wdth the results of Gale et aL 
are insufficient to provide a umque interpretation of complex patterns; so that we 
are obhged to make use of some empirical and certain theoretical predictions to 
analyse the patterns. No simple routine can be stated since every case is different, 
but the following example gives some idea of the kind of treatment. 

The example chosen is the following group of fines taken from table 3: 



intensity 

classification 

reference 

letter 

approx. 2pt 
(order of 
interference] 
23273*61' 

16917*50 

(3) 

3s^i:^2pm, 1 -> 1 , Q 1 

X 

16916*57 

(0) 

1->1, Q'2 and 
nd^A^^2pm, 2->2, P 6 

Y 

23272*34 , 

16915*33 

(3) 

3s^2i>sil, 1->1, Q3 

Z 

23270*63, 


The path difference 2^ corresponding to the position of the intensity maxim um is 
about 1-3757124 cm., so that the order of interference at the position of the intensity 
maximum is as given in the right-hand column above. The range of wave numbers 
in this group is only about 2-2 so that the members appear as a single pattern. 
Comparing 2vt's it is clear that overlapping orders will be formed for the lines m X 
and Y if both are single or if both have similar structures. 



158 


E. W. Foster and Sir Owen Richardson 

Our observations on this group combined measurements on four lines designated 
(^ +), (? 6 ), {B) and (^), where {A +) was evidently the next higher order of {A) and 
need not be considered further for the present discussion. This leaves: 

component (?6) (B) (A) I 

observed intensity 1/3 3- ^ I 

fractional part ,/to —0*223 —0*438 — 0*681 J 


It will be noted that none of these fractional parts corresponds with the fractions 
of the listed orders of interference (1). KTow it is clear from the intensities (3) and (3) 
assigned respectively to X and Z and from the assignments of intensity (3) and (5) 
to (5) and {A) respectively that as far as the bulk of the intensity is concerned, 
(B) and (^) together somehow belong to the lines X and Z. Since the decimal parts 
of the orders of interference for X and Z are about the same and since Gale et ah 
intensities for these lines are also the same, this suggests that about one-half of (B) 
and one-half of {A) should be assigned to X and the other halves of each to Z. 
Assigning weights 3 and 5 to lines {B) and {A) respectively, then 

(3x0-438 + 5x0*681 

\ 8 



is obtained for the fractional parts of the 'optical centre’ of these components, in 
satisfactory agreement with the decimal parts of the orders of interference for 
X and Z, 

In further support of the above is the fact that X and Z are classified lines, and 
it is known from results obtained on many rather similar Knes ending on 2p that 
each is likely to appear as a doublet, with components a and b about 0-18 cm.-^ 

apart and intensity ratio ^. It therefore appears that X is composed of 


1^62 intensity 2 




KB) 3 

ratio should be = ^ * This suggests that B still contains some unassigned 

strength. 

Turning now to the line T, one has the component (? 5 ) and the unassigned in¬ 
tensity of about 1 in (jB). It if be supposed that these two constitute the line T, then 
(lx0-223 + 3X0*4381 ^ 

■“ I-T-} == —0*38 IS obtamed for the fractional part of the 'optical 


centre’, which is in good enough agreement with the decimal part of the corre¬ 
sponding 2vt» Thus to T is assigned components Ug, 6 g, having intensities 1 and 1/3 
respectively. 

It thus appears that the three observed components (? 6 ), (£) and (A) are the 
result of the blending of at least six lines of which component {A) has two and 
(5) has three. The effect on the pattern of the possible 2 -^ 2 , P 6 in Y is assumed 
to be neghgible since it would be very weak in our spectrum. 



159 


The fine structure of the spectral lines of hydrogen 

A final check on these assignments is provided by the combinations between the 
individual components 62 , < 23 , 63 with other lines having final levels of the 

2p ®i7 state. These will be considered in a subsequent paper dealing with the analysis 
of the lines ending on that state. 

In table 3 the observed zIj^’s for Ib^GlT-oO and 16,915*33 are both 0*193 cm.”^ but 
it is known &om the behaviour of many lines ending on 2p that more probable 
values are the alternatives also entered in column 3. Our incomplete resolution does 
not show up these small differences. 


12 . Genebal notes to table 3 

Columns 1-3. Column 1 gives Gale et al. values of v for the lines followed by their 
intensity estimates. Column 2 gives the transitions of the classified lines, the 
notation used being that of Richardson ( 1934 ). The abbreviation a is used for the 
Fulcher system 3p^iT->25^27. In column Z, Av gives the separation of the fine- 
structure components of the lines ending on 2p ^77 of column 1 where this is great 
enough to be measured. 

Column 4. In column 4 the notation used indicates the fine-structure com¬ 
ponents. Lines ending on 2p ®i7 have two components, but as they usually differ in 
intensity one may not appear on the photographs.f These are always referred to 
as a and 6 , a being the higher frequency and also the stronger component; in some 
cases numerical subscripts are added to these letters. Letters other than a and b are 
used to designate structures which have not been allocated as components of a line 
ending on 2p ^U. 

Column 5. The numbers in column 5 represent the intensities of the observed fine 
structures. Where only one component is observed in any particular line and there is 
no reason to suspect others of appreciable strength, the intensity I is always written 
1 = 1 . K another component is suspected (for example, because our measurements 
do not agree vrith those of Gale et al.) but has not been observed we write / = 1 . 

t On plate 4 the a components are always to the right-hand side of the h component in the 
same order of interference. The following examples show the doublet {a + b) near the single-order 
position, so that the intensity of the images represent approximately the relative intensities 
of the components: 16132-09 (5), 2d 2p 2 2 , Q2 (the lower part of the image of the h 

component is covered by 16126*19(10) in this picture): 16603-20(9), 2d^E-^2p^n, 1->1, 
Ql: 16710*63(6), 2d^E-^2p^n, l->-l, P2. (In this instance a and b are unresolved— 
Av = 0-06 cm.~^—and their intensities are nearly equal. The direct evidence that the line is 
a doublet is the abnormal width of the pattern.) 

Cases in which the doublet is not close to the single-order position are numerous. Owing to 
the intensity envelope effect the weaker ( 6 ) component is frequently not seen in the lower 
(left-hand) of the two orders. Examples are; 16458-03(5),3d^i7<i->2p®i7,3-4-3, J? 1,16654-24(9), 
2d^E-^2pm, 0->0, Q3-, 16692-00(5), 2dm^-^2pm, 1->1, P3; 16764-12(4), U^E-^2pm, 
0^0, Q'2; 16854-86(10), 2d^E-^2pm, 0->0, Ql; 16978-52(10), 2dm^-^2pm, 1 -> 1 , El; 
17192-31(8), U^A,-^2pm, 2 -> 2 , ©3; 17261-34(4), 2dm,^2pm, 0->0, Q2; 17279-01(7), 
2d^Aa^2p^n, 2 -^ 2 , El (here the weaker b component also can be seen in the order on the 
right where the a component is missing): 17316-56(5), 3d2jo 1 - 4 - 1 , Q'2; 17426-25(10), 
2d^Aa^2pm, 1 ^ 1 , El. 



160 E. W. Foster and Sir Owen Richardson 

The symbol is also used to denote intensities less accurate than usual when 
more than one component is observed. 

When two or more components of any single line are observed, the intensity 
figures are intended to be rough values of intensity ratio. The figures are rough, since 
(except where otherwise indicated) they are derived from ordinary visual estimates, 
but a correction has always been applied for inaccuracy caused by components 
occurring at different positions on the echelon intensity envelope. 

The intensity-ratio system has been extended to include all components of two 
ormore different lines when an asterisk* is placed beside the intensity figure. K this 
system is applied to two different groups of close fines which occur consecutively 
in the table, the members of the second group are distinguished from those of 
the first by having two asterisks**. The intensities of members of each group are 
relatively correct, but the figures are not consistent in passing from one group to 
another. 

Column 6. Column 6 gives the decimal part of our v values for the structures, 
the integral parts being the same as for the corresponding figures of column 1 
except where a change is indicated by the insertion of the units figures in column 6. 
It should be remembered that our p is usually about 0-03 cm.^'^ less than that of 
Gale et al. 

Column 7. We believe that the figures given as tolerances in column 7 correspond 
to about twice the probable error (which can seldom be evaluated owing to in¬ 
determinate effects caused by blends). 

In such a complex spectrum as Hg, using a wide slit, and with only the coarse 
dispersion of a Hilger E1 spectrograph available to separate the fines, many cases 
of overlapping patterns and overlapping orders of different lines necessarily occur 
(quite apart from the numerous blends known to be present in the fines in Gale et al. 
list in addition to those explicitly mentioned). Knowing the echelon step width it is 
easy from Gale et al. table to predict when such overlapping orders are fikely to 
occur in neighbouring lines. To condense our notes we have not, therefore, always 
drawn attention, to these instances of overlap. However, where they are known to 
occur or where they might be expected to occur we have allowed a greater tolerance 
than would have been quoted otherwise. 

Ordmarilj’’ where there is no confusion the tolerance figures depend on the number 
of times a fine has been measured and on the concordance of the results. 

In general, the tolerance figures given are a kind of resultant of the effect of the 
concordance of the measured values and of the factors referred to in the original 
notes to table 3 (see below). 

Column 8. The numbers in column 8 refer to a series of notes which will be found 
at the end of table 3. 

Our original notes to table 3 are elaborate and are too bulky for publication. 
They include remarks on {a) blending, including predicted wave numbers and 
relative intensities of the lines of a blend, {b) details of our analysis of complex 
structures, and (c) new observational data which could not be tabulated. The 



The fine structure of the spectral lines of hydrogen 161 


residue of these notes referred to in column 8 is mainly concerned with (c) and a 
few of the more outstanding of the other items. 

(a) Wave numbers of blended lines are predicted by starting with a related line 
whose wave number is known and whose combination difference with the blended 
line is also known. The accuracy of this process can vary a great deal. There may be 
as many as six or more observationaUy sharp lines all free from blends and all con¬ 
nected with the predicted line by accurately known combinations. 

In such a case there should be Kttle difference between the six or more results, 
and the mean should be of the same kind of accuracy as that of the measurement of 
any good line in the spectrum. 

At the other extreme the only related line available may itself be a blend. In this 
case one has to calculate the frequency of the constituents one is interested in. This 
requires a knowledge of the intensities of the two constituents. This can rarely be 
obtained accurately. Tinally, the known combination on which one depends may not 
be very reliable. 

There is obviously scope for much variation in the tolerance of the final result 
given by these processes, and assessing its value requires delicate’^judgement. 

Predicted w’ave numbers are always given in square brackets [ ], and unless 
otherwise stated they are on our scale, i.e. Vj.- It should be remembered that 


Vq^ 


0*03. Predicted intensities are also given in [ ] 


m 

m 


is frequently written 


I(ajb) for convenience of printiog. Similarly, means etc. If, for 


example, I 


(RO) 


(« 2 ) IBEJ 


*8jn 


this means that I{B 0 ) = SK and I(Q 2) = ZK, K referring 


to the scale of Kapuscinski & Eymers ( 1929 ). 

( 6 ) To cut down additional notes we have frequently used the following scheme 
to indicate briefiy how the various components in a complex pattern (including one 
or more of Gale et al. lines) have been assigned: 

As already pointed out, intensities of members of any particular complex pattern 
are usually asterisked, i.e. they are aU on the same scale. In such a group, besides 
the usual letter for each component of a line (column 4) another letter is quoted in 
brackets. These bracketed letters indicate the observed components, and they may 
occur more than once amongst the various members of the asterisked group showing 
how each observed component has been assigned to the different transitions. See, 
for example, the fines 16,915-33, 16,916-57, 16,917-50 in table 3 , In such cases, 
corresponding bracketed intensities are also given, the unbracketed inteusities 
indicating how the observed intensity has been distributed amongst the various 
fines. In several such cases there are not enough data to distribute the intensities 
uniquely from the purely observational material. 



“TT5Z 


TV^'^osfer'aricTSlf'Oweii Richardson 





& 

■—■ 


73 

& 

ll 1 
1 

1 ' 
1 

I ' i 

s 

0 

1 “ 

' PM S 

TS 

■c 


-♦s 


A 

'p 

§ 

§ 

S ^ 

Id «"• 


S'^>nr».ir OiOiO OI> C0«0 

OOOOOOOO w oooooooooooooo I o o ooooooooooo 

rB o OOOOOOOO o oooooooooooooo o 6 ooooooooooo 

S + 1 


o e<5 C5 oa o rf* 


oqi ao 00 


. > w W- — ^ ^ -___ - - CO o 00 U5 O O CO Tj( I> 00 o » 

OiOOOtfOOCMO 00 Wt-O'^iiOeOOOXOOCONOO , o ococ^co a 

^ trcot^oo-fcoFTiiOfic 


» « « HM* ♦ ® 

^ C0^a0i-100(M i-4«cq I-) O » 


^ v 4 .* 


o e »!^s2,A 

^ 5S< H 

O- ® H 

M ^ S kS 5S5 tg *0 t4 tS »0 Sii«6iOSe^S»0«4 <3 ►053*0'^®^ 


.1 



a 



IC 

a^i3q 

J 

« 

’ft 

j; 

ff 


n »w 


®>co 


p 5 c3 ^ ^ 

a? o ca 

1 “11 

^=5 o &' 

'-^■f ^ 

l”? t a. t 


t"? t U t t i- t 

B^Ss'Sggib 8«<|SSb'S S Si S 


^ ^ w ^ 

^ o* 

t X 

^ O I '-' r-i 1 o 

? ? ftfff 

Tq rq ^^3 f C^3^ 

09 n H ‘ H 0 ) 

CO CQ CO CO «o M CO 


^ cq : - 

0> 




> xfis 00 o o csi cq 

I 'dt CX3 C5 CO cq 


> O t> o « 
» O iQ (M oq 
? O O 50 O 
» t- l> t> I> 


^ O CX) 

a> 00 00 

iQ iO »0 


o T}< OS (5 qioco«os 

O O l>e 0 rHo 5 iO 

OS I> db 

cq cq o cs 0550 o >0 

»0 io io 



TPKeJTne structure of We spe&tral Ivries (f^i^rogefT 


163 


11 
^ II 

-rr- « S 


,2 

O 


'§) 

o 

s 

^ > 
C3 r-4 

n O 
n to 

S 2 

5^ I 


Tit CD lO 
^ O O i-i 

o o o o 
o o o o 


»DCD OOlD ^ t-Tfi »0 *0 

C<| »-i o j N <M cq CN» ril «5(M ffO 00 ^ 1 > ©q f-i »$ CO CO CO »0 cc (M iO O O ^ oq i-f rH CO »o 

O OO I O O O O O O O 17I O O o o ® O O O o o 0000 0000000000 

o 66 66666666666^66666 6 6666 6666666666 

+ 1 


O l> ® o o 

10 I (M t- 00 CD 

ID t- ©“S o *a 


CO ©5 l> O CD CO CO 


00 t- O 
r-l 00 CO 


w w ta-- *..w - ^ ® O ,33 00 CO 00 O 

ID 05 O t-^ -<i< O ^ r-. .-4 CSI l> rH 

COC^CO'^l< 0'7*0®^25tyii2'«lf'«HOO 
*— ‘ ****** ****<^ 


00 . ^ ^ 
^ i M 
00 


05 M 00 ©q »o to o ©q OO o o 05 

CO CD CO 00 ID ® ® CO ID O'-f »D CO 05 

M O O ©q CO Jp O Op ID •<* CO CO 

66 


t> Hsi©q»-' »■ 


1—Ki-HrH»—li—(6'^*^66 '~-<f 


o ^ 
I = 


1^ rH CO CO 


ID 

^ 6 »■ 


Ip »p 

) 6 1 —f 6 i-H ©q fH 




. . • ^ ^ ^ ^ 


e<5 e*D 


00 

o 

6 


o 

cq 

6 


o 

I 


ao 

o 

6 


CO 

05 

9 

6 


6 * 




sc 


t 

1—) 

£3 

69 



•« 


■a 


CO 


1 


«—1 
©q s 

Tfi 

CO 

§ 

©q 

©q 

©q 


'Til 


S 

so ^ 


CO 

CO 

ID 

o» 

cq 

cs* 

05 

o> 




©qO? 

§o> 


©q 



Oh 

0 ? 


ril 

i-H 

i> 6 


t-H 


©q 



C? 


cs w ©J 

©q" 

0 “ 

6 

0 

t 

9 

l> 

t CD 

t 

t t 

t 

t 


t 


■Qs t 

t 


111 

t 

i 

t 


©qO 

rH 

rHI 

©qco" 

©q 

t 


o 

t 

o « oq 

©q 

0 

0 


©q 


H « 

Cc 

«a 



eo 

"Ifcf 


fc? 

©q 

cq« « n 


fcj 

69 

cs 

t 


^ a 



w’si' 

S' 

t U 

(S ca 



^sss 


1 ' 


t 

cc 

t 

■a 

tl 

t 

•a 

m 

t 

t 

t 

•a 

CO 

fS t 

® t;? 

t 

O 

s» 


■f t t-t 

*D #9 « e» 

t 

■a 

t 

•a 

a 

t 

fc? 

«3 

Sgi 

'TS 

CO 

ss 

S 


S? 

CO 

^*q3 

CO CO 

«5^S 

O ee 00 

CO 

CO 

?J CO CO CO 

'tS 

CO 

eo 

CO 

CC 






--S 







»^S 







0 

S. 

iD 

©q ID 

■ JO^ 

to 

2< 

eo 

-4 ® 

CO 05 

ynH 

cq 

.-H oq 

CO 

©- 

tH 

0 

©q 

1—1 







•s—■ 







w 




)D 

05 

»-< 

0 ©q 

0 



©q 

eo to 

00 00 ■ 

0 

a 

CO l> 

00 

i-Ht 

to 


ID 

©q 

ip 


05 eo 

eo 

to 


1-4 

00 ® 

'f 0 

9 

i—i 


® 

0 

*9 

eo 

to 

6 

1 —f 

pH 

^ 6 

4i 


6 

6 

6'6 

6 

6 

6 


6 

6 



6 

©q 

©q 


0 ® 

to 

to 

10 


•xit I-H 

1 —t l-Hl 

05 


05 CO 

00 

i> 

i> 

to 

ID 




CO 

CO 

eo 

CO 

CO 

CO CO 

CO eo 

©q 

©q 

cq cq 

cq 

©q 

cq 

©q 

oq 

r- 

!>• 

I> 

i> i> 

r- 

*> 

ir- 

1> 

©- c- 

i> i> 

i> 


i> 

r- 

i> 

©- 


*> 

(—( 

f-H 

iM 

i-t»—r 

r-( 


»>H 


I—1 

>~4 rH 

rH 

f-H 

rHt 

1-4 


(“I 

1—4 




Table 3 {continued) 


"■JCOiSlJCfir ^tiTTcr nix" tJli .OJlUIlJEU'tDSOIl 


1 


-§ 




O 


-P 

§ 

t- 


Tj 

tg 



o 

t 




JO JO 00 JO CO 

5 <M } i-f eo-rii tH O O O O »-h OO OO lO lo fi» i-< <-l « <N CJ »0 CO 

Hg o [ooooo O ooOOOQoooooooooooo 

§ o o o o o 0*0 o o o o o o o o o o o o o ooooo 
o 


00 JO 

•Ncooolo <-i CO eq CO CO 
ooooo lOOOOOO 
66666 1666666 


.! od r—CO 05 QO OO 0500X050000JO.CSIl>l>eO r-(O05 ^r-i® ' » JO JO l> JO JO 

££ J 05O l> I-H Gd 00 05 1-1 X CO rH X 05 05 ® I-I rij CO X CS JO JO |> o 1> 00 CO 1> O (-i X X l> O O 1> CO X 

• S lOX'sJXOXX 05 O 05 r-f o XX <M r-J *? 05 O O X JO X ( £> CD 05 i-W <35 !> O , 05 —i 1> rii JO 

^ O ' ^ .^. 




I I 


J "§ OS* 1^05 »-) 05 

g V. 


JO 

X 


-t*® 

■s 05 , 


JO 

■I OI F 


X 

-j X 6 JO 


%.* 
* X x* 
JO —' 

^ 6 C5 


O'j' 'j' '®: 


I I—I X F—i f-4 


I 

§ 

o 

I 

CJ 


fO 

e« 

e 


SfO cs fO ^ ^ e fO SjO ^ e.o 5^ F 


53 fO 


« ;ss;si 


SfO e fO 53fO 53 fO^ e fO 53 ' 


Q e 

53 fO fO fO 


""I 


o 

CD 

o 


X 

to 

to 

X 

o 

05 

CO 

05 

o 

©* 

©1 

to 

■fH 

X 

CO 

o 

c- 


1 

l> 

X 

6 

l> 

rH 

05 





i-S 

i-S 

•r 1 

*-s 

F—( 

1 

c«-« 

F^ 

6 

6 

6 

6 

5^ 

o 

6 

6 

6 

o 

6 

6 

6 












cs* 



©5 

6 


X 

o 

6 

I 



1 1 

05 

05 rH 

b> 


<o 

t 

'7 55 ® 

t t t 

o 

^ X ® 

t? 

CO 

safest?. 

*® « w J 




t 

o 


fc? 

a eg P» «e « co 

T3 ’^"53'53 ^ ^53 

X X X X >- 00 


Ti 



§ 

«o? 

X 

X 

X 

X 


c3 

Tj< 

1—1 


jH 

X 


CfSStt? 

C5 O CO 

b 

b» 

©f 

Q? 

©5 


^pH i-H 

fts 

Ki 

CO 

A 

ft? 

CO 

b* 

6 

fts 

6 


t t t 

X 

t 

t 

t 

t 

A t t 

t 

X 

t 

t 

t 


X o X 

X 

©1 

©5 


1 I-S 1—1 

pH 


X 

o 

o 


« « « 


t3 

03 

n 




P3 

fc; 

eo 



r-H 






« Si, ss, 

©^ 

(§^ 




ft? 

6 

11 t 

o 18 FS 

03 n es 

t 

13 

t 

83 

Xi 

P3 

t 

IS 

09 

t 

c 

n 

A A 

t 

83 

51 

T 

o 

CO 

t 

13 

n 

t 

t 

fcf 

03 

1 

c 

XXX 

X 

•ts 

X 

ts 

X 

S 



X 

"53 

X 

"53 

X 

^53 

X 


©5 


JO 

Os 




fc; 

ts 


t 

—I 

fc: 

80 

t 

•8 


g 

s 

£ 

)H 

o' 

pH 

o* 


x* 

s 

O 

I—( 

s 

£ 

<3 

w 


00^ 

0^ 

x' 

M’- 

JO 

Jp 

o 

o 

X 

X 

F^ 

CO 

o 

o 

eo 

c- 

X 

X 

X 

X 

o 

t- 

O 

05 

X 

1—1 ^ 

OS 

x 

o 

05 

05 

o s 

X 

F# 

X 

X 

1-S 

X 

X 

I—1 

X 

6 

o 

X 

05 

I—t 

6 

X 

1—1 

F# 

X 

1—^ 

f-H 

D- 

F-S 

6 

CO 

1—1 

1—1 

6 

X 

i-H 

X 

X 

1-S 

6 

X 

I'H 

6 

X 

a 

o 


C" 

F-S 

C"- 

*> 

i> 

1—t 

l> 

fH 

i> 

F-S 

1> 

1—i 

t> 

f-H 

1—1 

J> 

F-S 

!> 

1—i 

l> 

1—1 

l> 

fH 

i> 

1—1 

l> 

pH 

t- 

F-S 


X 

6 

X 

o 


X 

to 

X 

o 




The fine structure of the spectral lines of hydrogen 


165 


CO O »-l M3'N'># «0 M3 OO (M ^ 

o o oooooooo o ooo 
6 o oooooooo O oOO 


1£5 lO »—. lO CO 

|Oi—tOilO?<MCOi—I (NiOMSCO'^Ol®^*0©^*0®052 $3 

I o O o o O O O o o o o o o o o o o o o o o 
ooooooooooooooooooooo o 


o P-* MS T» 
o oooo 
6 o o 6 o 


!> OOCSOCOCOlft^OOO lA A I—11—1"^ 55"^^ '^JOM3»£3 o 225!2^SS 

la O OOi-<CO»-Hi-HlOCOOO o CO®'3'^500CO ^ OCXOWOlQi— tOCOeOCOi^^Oi;~^^^0 5^0 

m CO I— I o 1> CO 05 O CO »-H CO lO 00 I> lO lO Oil ©5'sf 05 |> la OO CO CO '<il< O CO 1> ip CO rH 


CO CO coco 
M3 o eo O CO (M 
^ CO 05 CO 


ri 

la * d* * 

< r—I HMi-i(o»©i| 


* * »C 
♦ * * 


^ ^ ^ ^ - . - * • 
■<1—it^COi^Ol^OO'^ «^0v| r 

III I I I I 


3 32S»SSS35s I" ^ §; SS SS-3-S 3 . s. 


8*0 Q e lo" ci'^cT H « 


H _ ^ 1-4 ^dT^ea ctJS ^ . 


I O 6 
( 


ft- o> o» 


f 

t t t 
& 31 


eo '• 

s “1 

t 

5 5sl 5 
ii ? tfl ? 

is -«t^ 

3 J T3 t cq 

3 O *8 « « W « 

3 P 03 COCO^ CO 


^ (S' a 
t t !? 


-1 

e3 to 


«t f 1 n 

TJ&^- 4-^£f ^ 

IsS* ,§• 

^1^1; ■? 

og 

pcorLi oCO CO 


00 o I> CO CO CD 

CO CO '4*1 CO CO 

000000 

I> t'* t'* l> l> 


s 

CO i>* oa 

cb o o 

OS O 03 


03 03 03 
CO CO CO 


Vd. 189. A. 


II 




Table 3 {continued) 


166 


E. W. Foster and Sir Owen Richardson 


3 • Sg 

66 


tH cq <M 
ooo oo 


t CO'^CJ I 01-< .-j ^ »o iO 

ooo ooooooo 


66666 6oo ooooooo 


lO lO THiOld »OloW5 

pH i-t pH I O O O « CO O pH i-H CO IQ »0 

ooo O jOOOOOOoOOOO 

666 6 66666606666 


.1 o 
02 A oco 
- S fH »0 
Ji, o • • 

\ 


S o 

CO oooooia eqioo e<ioooo ®eg*>2Sri!£2 S 
Q3coiacoco icoioc^ icocooouo^0'^i>o^^^ 55 
« I lr-»00000I>OQ0»0<NO’^<Nl> o 

.6«^^66 


OO'^OOCOOOrHCOCitM 
I )>I>l>OC30cO*^'®'Ol> 

I OJOFHOt^OcMO^FHOO 


* * * * 

5^ t* etu 

1-H p-l I-H J 




IL* * 
* 
o 


P-4 


lo ' o 

6pHCOrHpHO<MQO<MpH’^ 


g ^ ^ w w 




S i. pO* pO ^fpS? <rpcr ^TpO’cTpO^ I I Sma^ 55pO Q pQ g pQ^ ts pQ 8 ^ 8 .Q pQ. 1 ?? 


||g||6 

AO °0 


1 

i> 

Hj4 

t- 

pH 

1 2 

prob. 

obs. 

6 

O 


t “f =2 

t«t ’^. o ei 

[ifc?:: ti- 

"a, "g 

tf”? ^ I 
t^'t tatJs X i 
^Nfc! bj ^ 

» ’^*0* ^ w' g 

00 CO 00 CO 3 

CO^CO e^^CO CO d 


t ^ 

o ^ 

fc2 fc? 

ilf 

i.t t 


I’fl 

<M ” '*L ” 

" « .'f .. -!■ 

t t ^ et ? t 

^*5! i 

I* ,§- ‘t 


t '= t-e t 


W « 'X3 « «® « 

eoCO 'IS 00 

CO CO ^ CO CO CO 




CO CO 


SSL 

tPO 

5 d 

S COi-H 

S 005 


{M pH 

o o ^ 

00 oo oo 

CO CO CD 



The fine structure of the spectral lines of hydrogen 


167 


_ lO iO 00 lO MS la '-^ O \0 OO O CD 

CCUSCOOOCO r:® o—l p-KM cq ms «<J »0 CO o O O i-< O 1 -t p -4 C<J p-I pH'^O'^OOD? 

ooooo ? OOOOO O OOOOOOOO oooo o ooo o ooooooo 

OOOOO ® ooooo o OOOOOOOO ooo o o o o o ooo o ooooooo 


t>c*l> „ -MpH PH Tit pH t* o p-tQOCSOO « «» w O O O (M O « CO O M CD CO O pH 

O? pH iO MS 30 33 CD O D? CO PH o CO CS Cp* MS 1>lO MS OS Ca MS jH O <M 00 O CO 00 «> X pH pH CO 00 OS MS 00 pH 

ooi>qsi:> CD CD »o CO O t' o 00 i> CO pH o'I# N copho !:> lo CO id o 00 o t;;-®» i> so io m o 

o»o‘phph .ia'^-^**co.* * * * 'pH*t>co 


tPH-pjiOqpH Cl I-H CO pH Tit p 


I I 


* * * . J O ^ §9 

< CO 0^ pH pH p^«1 05pHOO«HpHpH<NiHCOpH COiHpHpHINp 

I I I I ^ III 




l 7 -> OPP ) 


ta a SpO, ^ 8*pO^ ew8pO g g pO e ^ 


;siaepo 8 po w S.O 


?fl? 

t^® t 

CO ^ d CO 




I “I 

f 

T * CO o A I 

CO CO « u CO CO 


s'*! 


ns 

§ 1 

eo 

ft- CO tH p. 

fl? 

<5 f 

t " t t 4 i 

fcr^'-K: 

c? 

« ?ilH PS 

t tf^t 

t -at ‘o 

•13 '35 00 

CO CO CO CO CO 


t t *? 

N t 

^ |&‘tc 
<§- ««<§•& 
t of? 


m ^ 

CO « X CO 


C? CO- 

fM ft 


“S <s.g- 

L t t 

fc! 


00 ® o 
® t- o 


CD pH 

o 


CO 'Tit 



o 

eo 


pH 





rH 

p 

O 'Ijt 



(N 

'cit 

o 

MO 

X 

1> t-p 

CO 

OS 

Tit pH 

■f 


MS 

X 

ds i> 

Ol 



CO 


O 

o 

o o 

o 

1> 

t> X 

MS 

CO 

CO 

pH 

X X 

X 

l> 

i> r- 

t- 

r* 

t- 

r- 

X SD 

CD 

CO 

CD X 

X 

X 

CO 

CD 

pH pH 

pH 

pH 

IH pH 

|H 

pH 

pH 

pH 


Tit 

pH 

o' 

X 

s 

xx 



pH 



N-PS«P 

O 

X 

t- 

Tit 

o 

Tit t- 

3 * 

GO 

lO 

(M 


iH 

X 


(X 


X 

DIO 

X 

X 

1 > 

X 

Tj* 

T# X 

CD 

X 

X 

X 

X 

X X 

CD 

X 

X 

CO 

X 

X X 

pH 

pH 

pH 

pH 

pH 

pH pH 


IM 



Table 3 {continued) 


E. W. Foster and Sir Owen Richardson 


CCOOOIO'^ 00 CO 00 

§•. (MOOO<-iOf-iC<|'^i«C<l«sHOO N CO'^‘CO CO G<l (M <N O O CO W ^ CO (M 1-1 CO'«# (M <M CO lO (M 

g d 0090009000009 9 9999999999999O999999990999O 
0 0000000000000 O O o o o o o o o o o o o o o o o o o o o o o o o o o o 


.1 r*iC5I>»-4Tjit-»©i-l p.t^ l> coeaoo OOTHCO-rilCOCsO OOOOCO 10iMt^C0l>t^|>_„ 

q 2 'r^O'ii^lOlr-ri^cai-lIxOCOC^rH © l>C5C0C0»-ti0{5<tO05C0Ol><MO<N03X0U5>0TH10W(Ml0©©;^ 

“S oicoci*9cocoi>co9i>»^o9 oo 99oo9f^i^99i>'i^co'^co9co9rH99990(;No©oo9 
.dbdb*-^.cbdbi>t>. ’*'i> 


Hb»*^ ■ 099 

■i lO 1—I 1^ ”f" hNi i —( r-<i—fi—<COi—(COi—•CONCOi—lN>—iCOi—li- 

III I I I I 


t I-I HnCO 

M 




o , 

Ui ^•r 

§ « « Qi-O S Q »0 ^ I ,8 >o I 8 <5 8 ^ « t4 I I M » 8»0 8 


000 

I I 




2fff^ 


ot«ot f 

ifflS i 


^ ’8 -hT 

CO CO 8 CO 


1 t I 


4 ^xx ^ 

e 5 » CJ O p-» e« « 
8 Sh -> « 8 8 

CO 8 8 8 CO CO 


s. 

1-1 0 C 5 

■'^i—1 "W 

0 0 

2. 

9^ 

0 CO 

1—< ■*—' 


S' 

S' 

S' 

rH 

0O S' 


i _3 ^ 













J—t 

0 CO 0 

9 

CO l> 

CO 

9 

CM 

I-H 

0 

9 © 

1-^ ao 
CO © 

CO 

<M 

0 

00 

JO 

CD 

1-1 J> lo 

1 > CO 

» 0 0 

© 9 pH 

d 1 

l> CO 

cst »-t 0 

CO CO CO 

CO CO CO 

»-i !-( J—1 

6 

0 00 
10 JO 
CO 0 

1—) 

© 

I> 

io 

CD 

i-( 

»b 

i> 

30 

CD 

1—i 

CO CO 
t' 'CH 
iO 10 
0 CD 

1—< 1—i 

db 

'ish W 
iO JO 
CD CD 
1-i r-i 

0 

CO 

10 

CD 

fi 

t> 

»—1 

JO 

CD 

»-4 

s 

l-< 

10 

CD 

I—t 

1 ^ 

l-H 

JO 

© 

1—1 

i> S 

0 0 00 

JO JO rH 

CD 0 CD 

1 —I »-< 1 —1 

© © CD 
00 i> 

Tfi 

CD CD CD 

pH 1—» PH 



The fine structure of the spectral lines of hydrogen 


169 


SO „ C<5 

oooooooooo 

0000600000 


6666666666666666666666666 


o O so (N « oa eo 0 so so 10 CO o so O I> (N O »Cl O I-I so 1 > O >-• I> iO 05 00 O CS Oil O CO »0 <fH so ©i? r-i 

1000 '^ 00 O<MC 0 C 500 S 01 OI>C<l'^lC|C 5 Q 0 C 0 C-Ti<S 0 »QO'rHt» 00 <Mi-<'^ 05 C 0 e 0 t>^e 0000 qH> 00 C 0 'siit>if-(Oi-<O 5 fiNI 
CD'si»O00t>00S0S0e0I>00S0't4iCqc0C5O'si't'S0e0j-li-<i-m0O05OC50^>-fO00e0C-00X50b-Ol>»OC5C5C0i—I-Si< 


JO «5 CO 

4 6 ‘ GSl 1-1 

I I 


1 HoJf—(I—( 


j 

,-^p-jWi-Hp-iCO»-tC<I HPii- 

M ? i ? 


t-r ^ 

53*055 •• 53 h - 1*0 -« t 4 

QjO h;3s»‘^««4S3'>' t tSN SjO a ^ .Q > 0 . S5 


t f t 

- g ^ ^ ^ 

^53 i e ^'ts 

CO ijcoppcoco^ 


+y-.i 


IB ^“IBs' 

- 'sa d « ^ Sit’Ta 
iS eoTJ iS «wco 




|N S' S‘co' 


CO O CO -sH 


JO 00 CM '(f* 1> O 


CO CO o (M 

so O op JT- 

6 6 6 6 

^ JO Til ^ 




CM CMCSISM 


OlOSM oocooi 
>i«CM 04 000 




170 


E. W. Foster and Sir Owen Richardson 


2»-« -Tit laiOrPlOcO OOM5 00 CO 00 

31 eo-Sif eo CO O 1-1 tH O O O o i-« CO O O C<» <M eo co O O co O oqio iOiOOC<| 

g^ oooooooooooo o 090000000090 9999 

JS 0 000000000000 o 000000000000 0000 


CO o 

o o CO ^ 
0000 


,| CO CO O »0 00 o CO N CO O eo CO i-« OnI O CO r-i 10 O lO O O 00 fH 

q 2 J JO 00 10 « 1-H jW CO tH O OS iO 00 eo 00 COso O CO l> rn CO 1-1os I> X O O 

**59 JO CO CO XO CS r -1 OS i-»-Til (M (M r-t i-l CO CO l> CO CO f-< C<J i --4 O X l> tH C<l JO i-H 

Pro.^.. 


X CS 10 <M 
CO OS O 
O X 'Cfl '<!*< 


|1 tftr 




§ ^ 

g w ^ 

ft 

S'* « 


.y « X 

•1 + 

S T - X 


N 


tq 

tq 

«a 

ft 

1- 

cr 

t 

t 


fc! 


na 

X 

X 




CO 


XOp-«<N^ 5 


*5 A 

111 tli 

2 * onisinS' 


H ® CO lo OS 

w'j' JO X OSO 

S o S; S 

r»y q » x x x 

^ o F-t |~( ,H »-t 

X X XX 


22 iSS C»©3[<-HXX os I>XXXCS| 

9 99 ' 7 ’ 9 c? 9 ' 7 * 9 i>eot-xc^ 

X XOQ cbftocods ^ ftjO!>COCS 

X XX (MIMX'^X X xcqxx-^ 

»H 1—4 i-H F—I o o o o ooxxx 

X XX XXXXX X xxxxx 


«<=■ 

CO 

«=t 

t ^ 

X<M 


X X ON 


X r-l X I—t 
^ XX 
O O OS OS 
X X X X 


“’ft * 

pH 

tit li,fc!n 5 

ts»y tc4 


t ef e 

2 S 252 8*, 


XOOi-i OOXp 


l> XXXN CM O OS t 

N NNOt'- t^l>XC 

OS c^sososx xxxc 

X XXXX XXXI 




The fine structure of the apedrcd lines of hydrogen 


171 


1 

I 


00 lo OO O 50 

(M tH O^ -h OO »0 iO rH o 
o o oooooooooooo 
o o oooooooooooo 


»0 O 05 I-I 1-H 

o o o o o 


I> lO 

O to o o 
o o o o 


ooooo oooo 


t* C 3 to to t-l> to 00 
'5fi O O i-H o O r-i O 
OOOOOOOOOO oo 

6060060660 66 


00 to cq CO 

Ovl 00 

t* o CO l> 
* (Ni pH 


>O 00 !>-eO<NOat-C 05 DcOr-i 
tOOO'^flCCip-ilOCOCOi—(,-tOO 
»^OCSOO«OOOINOOI> 

pH * * * 6 * 


cq|i-H>C5Cs| '^r»tr-0 ICn-JCsOlt-COOCCOtOpH OiO 
I00t00»-ll0 Q0I> O CO to 00 to »0(M »-l ^ o 00 O i-H OS 

I o « cq to e<i O op o o 001> o o CO pH ® a o 

10**** ***’fl 4 ‘’**'**io** '(N 


o* 

T 


eo ^ cq ^ Pt pH i-H 


pHpHpHpHpHpHpH»HCqpHrHipq 


fi 


H 55 






o 


to 

o 

6 


11 


53 


1 

s 


t 

t 


'*§ 

T 3 

§ 


o? 

CO 

3 t 

So 


CO 

6 

t 


o» 


cq t ^05 y 

o © 

.. * Si, -.<=^533 
oq CO <^05 + g 3 

111 t^ll 

©qco^co^'cSlS 




Cq pH 

60 

t t 

oq oq 

S'£»l 


COpH O'^ 

o<»^o o 

t t't t 

eq to f cqoq 




^ ^ CO 05 A 

S'S'tyS'ei^f. 



o 


i> 

o 


0 

oq 

§ 3 . 

t^o[o 0 ^^ 

s 

eq pH 

pH 0 pH 0 pH 

0 eq 0 cocq 


pHoq 

w 






•*->•0 '—'pH ' 




CO 

to 

03 eo 

0 0 OtHC5 

pH 

oq 05 0 cor- 

00 O-^ CO pH ■ 

pH 00 r- 00 pH 

eq JO 

eq® 


0 

00 oq 

pH 05 «» (XH> 

CO 

o«o« too* 

pH® 0)03 

® ® <;p rapH 

^0 

® 0 

6 

6 

CO 

i> 6 6 co^ 

6 

6 6to^6 

(N66pHO 

^pH N® 6 

66 

4i6 

0 

05 

05 QO 

CO oq cq eq pH 

GO 

r- CO CO CO JO 

cqeqeqpH 

0 ® to® ® 

eq cq 

® Hit 

00 

1 > 

t' l> 

r** 

CD 

CO CO CO CO CO 

tQ 50 CQ CO Pp 

CD uo JO to to 

JO to 

hK^ 

to 

PH 

to 

pH 

to to 

pH pH 

JO to to * 0*0 

pH pH pH pH pH 

to 

pH 

to to to JO 10 

^H pH pH pH pH 

to JO to to to 

pH pH pH pH pH 

JO to to JO JO 

pHt pH pH PH pH 

JO to 

pH pH 

1010 

pH pH 




172 E. W. Foster and Sir Owen Richardson 

Notes eefeebed to m table 3, ooLUMisr 8 

note 


number 



1 

17572*31 

component 

a 

Pz 

h 

Pi * 

There may be a further component in which case the alternative 

structure is: 

I(ajh) Vjp tolerance notes 

2*5 0*364 0*006 — 

0*3 0*19 0*03 (unassigned) 

1*0 0*14 0*02 — 

0*5 0*10 0*02 (unassigned) 

All h components are incompletely resolved; their mean Vp= 17672*127. 

2 

17461-59 

PI. 86. Discharge tube pressure 0*3o mm. Hg current = 25 mA. 

PI. 40. Discharge tube pressure ^0*16 mm. Hg current = 15 roA. 

Lines considerably sharper with PI. 40 conditions. 

3 

17441-32 

« 

P2 = [17441*21]; this may be faint. There is no evidence for the bracketed 
intensity assignments except that I(alh) = /'-'4/1 gives a mean v which 
gives a good combination and is consistent with the ‘regular and 
irregular’ doublet scheme. 

4 

17433-50 

Most of {x) might belong to 17436*24 (1 A). 

5 

17261-34 

Gale et al. intensities for 17261*34 and 17260*69 apparently inverted by 
misprint. 

6 

17252-65 

Line probably strengthens with increase of pressure and temperature: 
Bad combination with R 4. 

7 

17218-50 

R2 [17218*51], Q'4 [17218*47]. I (E2/^'4) =rw[15JS:/8K]. a and h have 
each been assigned to i?2 and Q'4. Aq Q'4: R'S interval seems satis¬ 
factory though this does not carry much weight because of poor mea¬ 
sures on. 17433*50. I(a/6) R2 should be -4/1 and I(a/Z)) Q'4 -6/2. 
These do not combine to give the observed result. On our plates the 
line is weak for (8) ((?). 

8 

17144*93 

Some of (a) probably belongs to 17143*60 (0) G, 

9 

17125*07 

Both components probably confused: see 17004*40, There is very little 
confirmatory evidence of ours for by combinations. ? ?6 component 
may be entirely due to 17127*12 (0) G. 

10 

17033*21 

I(QZIQ4:) probably >1. y assigned twice and it may also belong to 
17031-67, 17030*88 and 17030*19. 

11 

17023-67 

(g) and (5) unresolved. (?0) may be partly h of 1-^1, Q'2. 

12 

17004*40 

z (q) does not appear to belong to 0->-0, P'2; possibly x does. There 

may be other components masked by 17006*43. 

13 

16990*70 

1~>1, P3 needs [3/4] of the total strength. Only 4/11 has been assigned. 
This analysis gives doublet separations larger than expected for l-^-l 
R 3 and the two lines of 16988*72 q.v. with which the pattern is badly 
mixed. 

14 

16988*72 

See note 13. 1-^1, P'2 needs [—1/3] total strength. Q1 uncertain 

but expected to be < 2/3 total stren^h. It is assumed that there are 
no overlapping orders in 88*72 and 90*71, otherwise there are insuffi¬ 
cient data for a more detailed analysis. 

15 

16917-501 
16916*57 1 
16915*33j 

See § 11 for discussion of this group. 

16 

16841-63 

or some of it, might be due to 16836*04 (1) » 

jr(^l:(9'2;Q3:Q'4) = [3A::liiC:<3iiC;<0*3i5:]. 


There appear to be no lines between x and y with or 5) a 

possible interpretation is = + x = Q3, ^p = Q'4: with in¬ 

tensities 1;Q'2:Q3:Q'4) X [7:2:3:-|-]; but there are several other 
possibilities. 




16611-43 


16603-20 


16696-31 


16314-99 

16297-254-9; 

16294-37 


16208-92 

16203-93 

16168-95 

16132-09 
16126-19 


16060-31 

16046-28 

16039-lS 

16032-26 


17261-34 

17218-50 

17199-2S 

17192-31 


Foster and Bichardson 


Proc. Boy. Soc 


. A, volume 189, 4 


16240-81 


17451-59 

17441-32 

17431-57 


16393-70 


164SS-15 


16479-06 

16476-10 

16469-93 


16440-72 

16413-37 

16410-14 

16399-79 





































The fine structure of the spectral lines of hydrogen 173 


note 



number 

vg 


17 

16834-27 

(Z), (m), (n) not clearly resolved. 

18 

16822*00 

Very bad combination. Our results for the (*) lines of this group rather 
doubtful: there may be an alternative a^lysis. 

19 

16802-37 

I{ajh) seems to tend towards lower values with our higher temperature 
and pressure discharge. 

20 

16774*97 

P2 [16775*06]: most of the strength in (a) component is due to the blend 
16774*42, vP% is probably a bit high. Only very feeble signs of a; ( 6 )^ 
(5') may be feebly blended respectively with the a and h components 



of 16772*74 (1). 

21 

16764*12 

Microphotometer gives I(a/ 6 ) = 2*85 ±''-0*3, 

22 

16753*44 

I{ajh):k’M^ unless there is a blend (’a;) with the low frequency com¬ 
ponents; there may be something in this belonging to 16752-69 (Oa). 

23 

1^596*37 

Surprisingly faint for a G (10) line on omr plates. 

24 

16517-89 

The centre of intensity seems to shift towards the low v end of this group 
-with a high temperature source and most of the strength then seems to 
be in or a^h-^ and dg, aU of which are blended by the increased line 

width. This may account for the different r’s of different observers. 

25 

16488*15 

(?a 5 ) and ( 02 ) are resolved on only one plate. There may be only one 
component and the apparent resolution a spurious effect of grain. 
Mean a') = 16488*175±0-01. 

26 

16469*93 ' 

Mean v for a and h is 16469*93o ± 0-005 cm."*^. tar may be spurious. 

27 

? 16458*03 

V may differ from the given value by n times (0*727) cm."^, where » is an 
integer (or zero), probably about 5. 

28 

16330*59 

The z component probably contains h' of 16332*11 as a weak blend. 

29 

16039*18 

The wave number may be variable (? blend). At the lowest pressures we 
get 16039-167 cm.-^ • 

' 30 

16032-09 

Poor fit with Gale at oZ. Perhaps there are some overlapping orders 
giving, for example, a component at wr—0*727 om.*“^. 

31 

16031*77 

Remarks of note 30 apply except that v = vz-f 0*727 cm,“^. 


13. COMPAEISOK WITH RlCHAEDSOlSr AOT> WiHLIAMS’ 

EESTJIiTS EOB BAHBS ENDING ON 

This paper will end by comparing the splittings [Av) and the intenaty ratio 
I(a/ 6 ) of the fine-structure components of lines of the bands ending on the 2 p W 
state as now found (table 3) with the values estimated by Richardson & Williams 
( 1931 ). There are only six fines, all belonging to the system 3 d^Z~ 2 p®iJ, and the 
data are set out in table 4. 


Tabus 4 

R. & W. (1931) 


allocation and 
wave number 

, -- 

Av 

about 

I(alb) 

about 

P. & R. ( 1944 ) 

Av I{alh) 

remarks 

Ql 

16854*86(10) 

0*21 

0-^0 

3 

bands 

0-194 

2*5 

U means insufficieiitly 

QZ 

16654-24 (9) 

0*17 

JJ 

0*173 

2-5 

resolved for a reli¬ 

jB2 

16834*27(10) 

0-11 

U 

0-114 

4,U 

able estimate to be 

R4 

16710*63 (6) 

H 

U 

0-06 U 

ca, 1 

made 

Ql 

16603*20 (9) 

0-21 

1-5^1 

5/3 

bands 

0*196 

2*5 



16584*72(10) 

O-OS 

U 

0-118 

5 

— 



174 


E. W. Foster and Sir Owen Richardson 


The Talues in table 4 represent complete agreement within the limits for which 
reliance conld be placed, in the estimates made from the 1931 observations. Too 
much confidence should not be placed in the last decimal figure in the 1944 results 
whether for Av or I(a/b), ^hut some lines are better than others. Some guidance can 
be got from the figures in the tolerance column of table 3 with regard to this. 

We wish to express our thanks to the Department of Scientific and Industrial 
Research and to the Royal Society for the provision of financial assistance which 
enabled this work to be carried out. 


References 

Barrel!, H. & Sears, J. E. 1939 PhU. Trans. A, 238, 32. 

Drinkwater, J. W., Richardson, Sir Owen & Williams, W. Ewart 1940 Proc. Roy. Soc. A, 
174, 184. * 

Gale, H. G., Monk, G. S. & Lee, K. O. 1928 Astrophys. J. 67, 89. 

Kapuscinski, W. & Ejmaers, J. G. 1929 Proc. Roy. Soc. A, 122 , 58. 

Meggers, W. F. & Peters, C. G. 1918 Bidl U.S. Bur. Stand. 14, 697. 

Merton, T. R. & Barratt, S. 1922 Phil. Trans. A, 222 , 369. 

Richardson, O. W. 1934 Molecular hydrogen and its spectrum. New Haven: Yale University 
Press; London: Oxford University Press. 

Richardson, O. W. & Williams, W. E. 1931 Nature^ 127, 481. 

Schuler, H, 1931 Z. Phys. 72, 423. 

Williams, W. E. 1931 Nature^ 127, 816. 

Williams, W. E. 1933 Proc. Phys. Soc. 45, 699. 

Williams, W. E, & Gogate, D. V. 1938 Proc. Roy. Soc. A, 167, 509. 

Williams, W. E, & Middleton, A. 1939 Proc. Roy. Soc. A, 172, 159. 



The fine structure of the Hnes of the H2 spectrum 
which go down to the states 

By E. W. Foster and Sir Owen Richardson, F.R.S. 

{Received 5 December 1944) 

The echelon data tabulated and described in the preceding paper referred to here as Paper I 
are used to investigate the fine structure of the Hg band lines discovered by Richardson & 
Williams in 1931. Practically all the lines with resolved fine structure have as their 
lower states and the number of such resolved hnes is now very greatly increased. 

The new information is applied in the first instance to the critical examination and improve¬ 
ment of the existing tables of the (composite) 2p®il level differences. These are a basic tool 
in the elucidation of this spectrum. One result of this part of the investigation is that the 
pairs of hnes from which each composite level difference is derived divide themselves into two 
groups which have shghtly different values. 

The fine structure shows itself visually as a resolution of the lines hitherto regarded as 
single into two components which we denote by a and h, a having the higher ffequency and 
intensity. The differences for a or 6 are considered separately and shown to be different from 
each other and from the differences of the composite lines. A number of regularities among 
these new differences are pointed out. 

The lajgest doublet separation Av observed is about 0*22 cm.“^ and the smallest 0*Q6 cm.*"^. 
The lines (which go down from and II, A to are divided into a ‘regular* 

and an ‘irregular* group. Lines of the regular group have a larger Av and the intensity 
ratio of a to b is nearly constant and close to 2*4, whereas for the irregular group this ratio 
falls from about 5 to about 1 as increases from 1 to 4. 

The regular group consists of all the Hnes which go down to and the irregular group 

of all which go down to 2p^n^, 

The greater part (about 90 %) of the doublet separations arises from fine structure spHtting 
of the lower 2p®iT levels. When effects arising from the upper levels are eliminated it is found 
that the ^fitting, both of Zp^JI^ and 2p^n^, diminishes as K'' increases and at an increasing 
rate at higher K", 

In general these fine structures are closely parallel with those of Heg discovered by Monk & 
MuHiken in 1929. The data cover a wider range of states and fill in the gaps left by the 
‘missing lines* in the He^ spectrum. There are, however, some differences outstanding which 
seem to call for further investigation. 


1 . The rotational WAVS-NirMBER bifeebences 

The bands of the ^ctrum which have the lower triplet states 2 p (or 2 p ®i7^, 
an alternative notation) and %p were discovered by Richardson in 

1929 (see Richardson 1929, 1930; Richardson & Davidson 1931; and Sandeman 
1932). They are distinguished by the fact that in the really strong systems in which 
the transitions start from the complexes. 3^ and 3, 4 d ®iTthe intensity is 

almost entirely confined to the transitions of the axial vibrational sequence O'-O*", 
etc. For this reason progr^ in the analysis and classification of the 
lines of these bands has been governed almost entirely by an ^curate knowledge 
of the rotational wave-number differences and particularly of tho^ such as 
Q 1 -P 2 = iJl-Q2 which are determined solely by the final stat^ ^^ 27 ^ or 


[ 175 ] 



176 


E. W. Foster and Sir Owen Richardson 


This analysis is due to Richardson & Davidson (1931, 1933), Sandeman (1932), 
Richardson, Davidson, Marsden & Evans (1933) ^^nd to Richardson & Rymer 
(1934a, 6). It is based almost entirely on the wave-number and intensity tables of 
Gale, Monk & Lee (1928), the only ones which are both complete and accurate enough 
for the purpose. Now the present experimental work has introduced a new factor 
into this business. We find that all the structures in the bands ending on 2p ®i7 which 
have hitherto been regarded as single fines are, in fact, multiples with at least two 
members. The separation of these ‘doublets’ is not constant, but in most cases the 
components are clearly resolved on our plates. Examples illustrating these state¬ 
ments can be recognized on plate 4 of Foster & Richardson entitled ‘The fine struc¬ 
ture of the spectral fines of Hg: measurements with a reflexion echelon’, and 
fists of such examples are given in the footnotes on p. 159 of that paper which we 
shall refer to subsequently as Paper I. It follows that the wave-number differences 
such as Q 1-P 2, etc., which we have been using, are not really numbers character¬ 
istic of the differences of certain energy levels of the states 2p but are differences of 

sums of the energies of at least two such states, weighted in proportions determined 
by mtensity factors in a way which is irrelevant and disturbing to the elucidation 
of the energy-level differences which should be the really constant quantities. 

The logical course to pursue in these circumstances would be to abandon the 
existing QI-P 2 , etc., differences and to replace them by aiTnilfl.T- differences of 
corresponding components. Apart from the difficulty at this stage of being quite 

certain which are the corresponding components, itwould seem thattheresults would 

be of no immediate use in furthering the elucidation of the spectrum, as even 
now afi the available experimental data relate to what are very probably incom¬ 
pletely resolved fines. For these reasons we have postponed this task for the time 

bei^ and adopted a course more likely to be of immediate service in the reduction 
01 the spectrum, 

m work of Richardson & Rymer (1934 a, b) shows that there are band systems 
en g on p in the extreme violet as well as in the red and near-red part of the 

specti^hemdealtwith,anditisfikelythattherearenumerous bandsystemsending 

on -p n which are still undiscovered m various parts of the spectrum. Even in the 
near-red, some of the few still unclassified lines look like 2 p^II ‘doublets’ Tf 

^ A resolved, are constant enough for reducing 

^ta obtamed with the ^me resolution, evenif they are not strictly constants at aU 

systematic Wtions of an ascertainable nature, that fact might weU prove an 
im^rtant clue to the mterpretation of the fine structure of the fines 

thf 3 “^bers of the doublet constituents and measured 

dlXZ bne with tTe 

doublets unresolved can be calculated. 

However, we carmot strictly claim to have measured our intensity ratios directlv 
In few instances the strengths of the a and 6 components were photographic^y 



The fine structure of the lines of the hydrogen spectrum 177 

the same. In these cases the equahty of the blackening was checked^ using a micro- 
photometer (designed by W. H. J. CMlds). Knowing the approximate position of 
the components relative to the position of the maximum of the intensity envelope 
and knowing the form of the envelope, the intensity ratio could be calculated. In 
the majority of cases the photographic intensities of a and b were not equal. In 
these cases the intensity ratio was estimated visually (with or without micro- 
photometer traces) and a correction was apphed for the envelope ejBfect, In several 
such cases where photographic intensities were nearly the same the jSnal corrected 
intensity ratios are probably fairly close to absolute values. In other cases the 
determinations suffer more from the shortcomings of visual intensity estimation. 
However, since the separation of all the doublets is of the same order the error in 
the wave number of the ‘equivalent line^ is largely offset (in these cases where the 
intensity ratio is comparatively large) by the comparatively small weight to be 
attached to the weaker component. 

In view of aU these considerations we decided to redeterm ine the ‘constant 
differences ’ of the Q \-P 2 type by building up the composite lines from our intensity 
and wave-number data for the resolved constituents. In doing this, we expected to 
get more accurate determinations not only on account of our more accurate measures 
but also because our superior resolution enabled us to eliminate the effects of numer¬ 
ous blends. In many cases their actual presence had been only a theoretical inference, 
not an observational fact, and the extent of the interference was often uncertain 
as the relative intensities of the blended lines were only roughly calculable. 

'The reason for introducing this examination and revision of the existing differ¬ 
ences at the outset is not because we believe it is the most important refeult of the 
paper but because it is also a convenient, compact and critical way of assembling 
data necessary to establish other conclusions arrived at. 

A knowledge of the notation used in coimexion with the bands endi ng on 2j> 
wiU be assumed. It is given in Richardson & Davidson (i93^> P* ^ 

Richardson (1934, chapter vm). The alternate weaker lines, a characteristic feature 
of all the bands which arise from transitions between the s levels, are distinguished 
by dashes, as Q' 1, P' 2, etc. 

The data for the undashed lines, such as PO, $ 1, P2, etc., are given in table 1 
and those for the dashed lines in table 2. There are no combinations between dashed 
and undashed lines. The following notes will probably be helpful in interpreting the 
tables. 

As already pointed out above, the first use we shall make of the material in 
tables 1 and 2 is to recompute the rotational differences of the 2 p levels using the 
new echelon data. The results for the a terms are given in table 3 and those for the 
s terms in table 4. 

It is difficult to make general statements about the accuracy of the values given 
in these tables. It is doubtful if it is as high as the agreement of many of the figure 
might sugg^t. In the first place, it often seems higher than the tolerance figures 
in tables 1 and 2 would lead us to anticipate. In the second place, individual agree- 



178 


E. W. Foster and Sir Owen Richardson 


ments are often surprisingly good and in other cases unexpectedly poor. However, 
it seems certain that they are better in general than the old values got from the 
unresolved composite lines, and that it will be safe to substitute tables 3 and 4 for 
the corresponding parts of tables LXXVI and LXXVII of Richardson ( 1934 ). 
In doubtful cases, probably the best course would be to compromise by averaging 
the two values. 

Fewer term differences are represented in table 3 and 4 than in the old tables. 
This is mainly because many of the weaker lines do not appear in the echelon 
patterns. 

An examination of table 3 suggests quite definitely that the frequency differ¬ 
ences Qm—P(m-f-l) are not exactly equal to the differences R{m) — Q{m + 1), 
The* differences between the mean values of the Q{m) — P[m-\- 1) and those of the 
B{m) — Q{m+ 1 ) are larger than what one would expect to arise from the effect of 
the chance errors which operate within the individual groups. There is also evidence 
of. a systematic change in the differences of the weighted means with K values in 
each band and with vibrational frequency on passing from one band to another. 
On the whole, the evidence for the reality of these differences is greatest where the 
observational data seem to be most satisfactory. 

Apart from inferior resolution which might obscure them, there is no reason why 
these differences should not be found in the data in the older tables, such as Richard¬ 
son ( 1934 ). The means from these tables are set alongside those of dvp from tables 
3 and 4 in table 5. The close resemblance between the differences of the and 
throughout table 5 can hardly be entirely accidental. 

Having compiled these figures, it is of interest to see whether there is any evidence 
of systematic difference between 8v;p, and From the data in table 5 we have 
deduced the following information about the values of for the 

Q(m) —P(m-fl) and P(m) —Q(m-fl) 

differences there are fifteen negative values, total —0*4255 cm.”^, average 
— 0*0284 Qmr\ and seventeen positive values, total +0*6858 cm."^, average 
+ 0*0403 cm.~^; the average of all thirty-two of these values is + 0-00813. For the 
Q'm —P(w + 1 ) and P'(m) —§'(m+l) differences there are nine negative values, 
total —0*314, average —0*0349, and ten positive values, total +0*231, average 
+ 0*0231, the average of all nineteen values being — 0*00437. The average of all the 
fifty-one values of ^ is +0*00347. 

The only evidence of systematic variation in these figures is the positive average 
value for the undashed and the negative average value for the dashed line differences. 
This is probably caused by the fact that the undashed lines are on the average three 
times as intense as the dashed lines. The small positive residue for the whole fifty-one 
pairs of lines is about what would be expected from the larger number of undashed 
pairs. 

If the apparent difference between the final terms of the lines coming from the 
P, Af. levels on the one hand and the levels on the other is real, it should 



179 


The fine structure of the lines of the hydrogen spectrum 

throw valuable light on the selection rules governing the fine-structure components 
of 2^ If there were no fine structure such a difference would appear to be impos¬ 
sible, as has been assumed hitherto in the analysis of the levels. However, 

the observational evidence we have so far presented cannot be considered as con¬ 
clusive, so we shall postpone the further consideration of this issue until more evidence 
bearing on it has accumulated. 

2. The final. (Q(m)“-P(m4-1), etc.) a and b 

DIFFERENCES CONSIDERED SEPARATELY 

So far we have been concerned principally with the unresolved lines as measured 
by previous observers or with the corresponding equivalents built up from our 
measures of the wave numbers of the resolved constituents and our estimates of 
their relative intensities. Our main object in this has been to eliminate the effects 
of overlapping hues which it has not hitherto been possible to allow for. This has 
been done largely with a view to increasing the reliability of existing tables as tools 
for future use. 

When facing the ultimate interpretation of the data, one would expect to reach 
the goal more rapidly by operating with the a and 6 constituents separately rather 
than with the composite lines out of which they are built up. In any event, one can 
hardly hope to obtain a satisfactory comprehension of the whole without a good 
understanding of the operation of the parts. We have, therefore, extracted from 
tables 1 and 2 the differences @(m) —P(m-l-l) for the a (higher frequency, more 
intense) and b components respectively. These are set out in table 6 (undashed lines) 
and table 7 (dashed hues). Only the decimal parts of the wave-number differences 
are given; the integral parts are the same as those for the composite lines (tables 
1 and 2). 

In assessn^ the data in tables 6 and 7, it is to be remembered that they refer to 
pairs of lines which are not arbitrary, but the choice of one determines that of the 
other. This limit ation greatly reduces both the number and accuracy of the data. 
A line measured very accurately may be quite useless becau^ its partner is, for one 
of a variety of reasons, incapable of measurement. On the other hand, any ‘bad^ 
line destroys nearly all the virtue of a ^good’ line with which it is associated. The 
reliability of any pair is little better than that of the less reliable component. 

From examination of tables 6 and 7, the following conclusions may be drawn; 

(1) The a differences are in general not the same as the b dffierenc^. 

(2) The Q{m) — P(m 1) differences are in general not the same as the 

differences, either for the a ox b component® or for the combination of them which 
makes up the composite line hitherto observed. 

This statement about the combination also confirms the conclusion already dmwn 
from the data for the unresolved hn^. 

{cor^nued on p. 199) 



180 


E. W. Foster and Sir Owen Richardson 


1 2 3 


transition 

to 2^3 ®iT branch K' 

Q I 

P 2 

R 1 

P 3 

Q 1 

Bd^Uc 

P 2 

R 1 

Bd^A^ 

Q 2 

R 2 

Bd^P 

Q 3 

R 2 

Bdm^ 

Q 3 

Q 2 

P 3 

R 2 

Q 3 

R B 

Bdm^ 

Q 4 

Q 3 

7P 4 

P 3 

<9 4 

Q 3 

Bd^Ac 

P 4 

P 4 

scz^r 

Q 5 


4 

wav© 
number 
V(? 

16854-86 
16734-20 
17199-28 
16898-48 
17212-01 1 

17091-29 8 

17672-31 10 

17451-59 10 

16834-27 10 

16654-24 9 

17441-32 8 

17261-34 4 

17461-59 10 

17271-56 4 

17717-06 Sa 

17537-02 4 

17199-28 10 

16961-69 0 

17261-34 4 

17023-67 1 

17765-17 3 

17527-44 4 

17537-02 4 

17299-30 4 

16710-63 6 

16416-35 2 


K 

component 

37-5 

a 

b 


11-0 



44 

as 

K 


16-3 

&2 

{B) 

(a) 


?a 



?6 


17-0 

a 

b 


34-6 

a 

b 


30-7 

a 

b 


37-7 

a 

b 

(m) 

18-1 

a 

b 


19*6 

a 

b 


17-1 

a 

b 


30-7 

a 

b 


13-9 

a 

b 


5-9 

a 

b 


9-6 

a 

b 


44 

ai 


17-1 

a 

b 




(? 0 ) 


h 

(?m) 

5 

a 

b 


11-1 

a 

b 


9-6 

a 

b 


14-6 

a 

b 


21-8 

a 

b 



8 9 


decimal 
of wave 
number 

tolerance 

Vjp 

± 

•870 

0-004 

•676 

0-005 

•155 

0-04 

•OI 5 

0-05 

•228 

0-008 

•012 

0-005 

•432 

0-03 

•255 

0-04 

•012 

0-04 

1-827 

0-04 

•277 

0-01 

•105 

0-01 

•364 

0-006 

•151 

0-015 

•593 

_ 

•444 

0-006 

•273 

0-015 

•159 

0-03 

•253 

0-006 

•080 

0-006 

•347 

0-006 

•178 

0-01 

•310 

0-015 

*159 

0-015 

•593 

_ 

•444 

0-006 

•552 ‘ 

0-01 

•408 

0-02 

abs. 

_ 

abs. 

— 

7-023 

0-03 

6-807 

0-03 

•388 

0-005 

•228 

0-008 

abs. 

_ 

abs. 

— 

•310 

0-015 

•159 

0-015 

•63 

0-02 

•54 

0-04 

abs. 

_ 

abs. 

— 

U. 


7-023 

0-03 

6-807 

0-03 

U. 

— 

U, 

0-006 

abs. 

_ 

abs. 

— 


Table 1 . Acottbacy oe combinations of lines 

0-^0 BANDS 

5 6 7 

intensity 
G 
10 

4 
10 

5 



The fine strvxiure of the lines of the hydrogen spectrum 181 

ENDING ON 2^3*17. (a) ‘Stbong lines’(dndashbd) 


O-yO BANDS 


10 

11 

12 

13 

14 

15 

16 

17 

18 



weight 

intensity- 

weight 




intensity 



of 

ratio 

of 




ratio (for 

mean wave 

/iv 

Jp 

(observed) 

Ha/b) 



dvs 

calculation 

number 

(splitting) 

figure 

/(«/ 6 ) 

figure 

dVB 

Svjp 

standard 

ofmeanvjy) 

16854*814 

0-194 

6 

5/2 

3 




obs. 






120-66 

120-687 

120-741 


16734-127 

0-14o 

1 

?2/l 

0 




4/1 

17199-166 

0-216 

4 

[6/2] 

2 




5/2 






300*80 

300-789 

300*777 


16898-382 

0-177 

3 

6/2 

2 




6/2 

17211*959 

?0-186 

1 

?2/l 

1 




5/2 






120*72 

120*716 

120*741 


17091*243 

0-172 

3 

4/1 

3 




obs. 

17572*303 

0-213 

3 

5/2 

2 




obs. 






120*72 

120-737 

120*741 


17451*566 

0-149 

3 

?4-6/l 

1 




obs. 

16834-250 

0-114 

3 

4/1 

3 




obs. 






180-03 

180-046 

180*036 


16654-204 

0*173 

6 

6/2 

2 




obs. 

17441*313 

0*169 

2 . 

5/2 

0 




4/1 






179-98 

180*046' 

180*036 


17261*267 

0*161 

3 

6/2 

2 




obs. 

17461*666 

0-149 

3 

?4-6/l 

1 




obs. 






180-08 

180*066 

180-036 


17271*611 

0-144 

3 

5/2 

1 




obs. ^ 

Ovi'17717’04: 

_ 

_ 

_ 

— 




— 






180*04 

180*08 

180-036 


17636*961 

0-216 

2 

5/2 

1 




obs. 

17199*342 

0-160 

4 

[6/2] 

1 




— 






237-69 

237-672 

237*674 


€^vj.l6961*67 

— 

— 

— 

— 





17261-267 

0-151 

3 

6/2 

2 




— 






237*67 

237-682 

237-674 


17023-585 

0-09 

1 

?1/1 

1 




obs. 

Gfvj.17765-15 

__ 

_ 

— 

— 




— 






237-73 

237-737 

237-674 


17527-413 

?0-08 

1 

?1*5/1 

1 




o.c. 

17636-961 

0-216 

2 

5/2 

1 










237-72 

237-694 

237-674 


17299-267 

0-08 

3 

?1/1*5 

2 




o.c. 

16710*599 

0-06 

3 

1/1 

3 




o.c. 






294-28 

294*269 

294-239 


Gfi;j.l6416*33 


— 

— 

— 




— 


Vol. 189. A. 


12 



182 


E. W. Foster and Sir Owen 

Richardson 

1 

2 

0->0 BANDS 

3 4 6 6 

7 

transition 
to 2p^II 

branch 

wave intensity 

K" ve G K 

component 


Q 

4 16961-69 0 — 

z 


P 

6 16667-33 1 — 

— 


Table 1 


8 9 

decimal 
of wave 

number tolerance 

VJS> + 

abs. — 

abs. — 

abs. — 

abs. — 


zdm. 

R 

4 

17647-00 

2 a 


5 

17252-66 

2 


Q 

4 

17527-44 

4 


p 

6 

17233-19 

Zh 


R 

4 

17986-10 

0 

Q 

5 

17691-96 

0 


R 

5 

17179-60 

3 

Q 

6 

16831-97 

0 


R 

5 

17940-62 

1 

Q 

6 

17593-01 

3 


Q 

6 

16831-97 

0 

P 

7 

16431-36 

0 


Q 

6 

17593-01 

3 

P 

7 

17192-31 

6 


60 


11-0 

a 

-639 

0-03 

?6 

*49 

0-05 

11-1 

a 

h 

XJ. 

0-02 

10-3 


abs. 

___ 

— 

abs. 

— 



abs. 

_ 

— 

— 

abs. 

— 

9-1 

_ 

abs. 

_ 


— 

abs. 

— 

_ 

„ 

abs. 

— 

3-1 

— 

abs. 

■— 

_ 

___ 

abs. 

_ 

— 

— 

abs. 

— 

3-1 

—— 

abs. 

— 

22-9 

— 

abs. 

— 


Zs^L 

3d»27 


1 1 BANDS 



1 

16917-50 

3 

10*9 

ai (A) 

h (J5) 

•639 

•346 

0*02 

0*02 

P 

2 

16802-37 

10 

52*0 

a 

h 

■361 

-211 

0-016 

0-02 

Q 

1 

16603-20 

9 

27*0 

a 

h 

-247 

■051 

0-006 

0-008 

P 

2 

16488-15 

8 

30*6 

6 i 

8-056 

7-95 

0-03 

0-04 

R 

1 

16978-52 

10 

28-0 

a 

b 

-548 

-343 

0-006 

0-006 

P 

3 

16692-00 

5 

14-0 

a 

b 

2-032 

1-886 

0-01 

0*015 

Q 

1 

16988-72 

8 

22-7 

a* (A) 

(B) 

■636 

•415 

0-02 

0-02 

P 

2 

16873-46 

9 

39-4 

a 

b 

-453 

-271 

0*016 

0*01 

R 

1 

17426-25 

10 

40-3 

a 

b 

•276 

-086 

0*004 

0-006 

Q 

2 

17311*08 

9 

37-0 

a 

1b 

1-078 

•940 

0*02 

R 

2 

17086-67 

2 

9-2 

la 

b 

•70 

-475 

0*06 

0*02 

Q 

3 

16915-33 

3 

13-0 

tta (A) 

62 (B) 

-368 

-165 

0*02 

0-02 

R 

2 

16584*72 

10 

34-7 

a 

b 

•729 

• 6 U 

0-01 

0-02 

Q 

3 

16413*37 

7 

26-1 

a 

b 

-425 

-243 

0-025 

0-02 


3d»ir 



The. fine struciuTe of the lines of the hydrogen spectrum 


183 


{cowtin/ued) 


10 

11 

12 

weight 

of 

mean wave 

Av 

Av 

number vf 

(spbtting) 

figure 

(?i;,16961-67 

— 

“““ 

©vj'16667‘31 

— 

— 

0ii;,17646-98 

— 

— 

17252-60 

?0-15 

0 

17627-413 

— 

— 

Gii-,17233-17 

— 

— 









— 

— 

— 



_ 


0-»0 BASTDS 

13 14 16 

intensity weight 

ratio of 

(observed) I(a/&) 

Ha/b) figure Svb 

294-36 


294-35 

?2/l 1 


294-25 


294-14 

347-53 

347-61 

400-61 

400-70 


16 

17 

18 



intensity 



ratio (for 


Svr 

calculation 

dVff 

standard 

of mean yj) 

— 

294-239 

— 

294-38 

294-239 

— 



6/2 

294-24 

294-239 

_ 

— 

294-239 

t 

— 

347-60 

— 

— 

347-60 

— 

— 

J400-63 

— 

_ 

1400-63 

— 


1->1 BANDS 


16917-484 

0-20* 

1 

26/2 

1 

116-13 

116-144 

116-140 

16802-340 

0-160 

3 

4r-6/l 

4 




16603-191 

0-196 

5 

6/2 

2 

116-05 

116-161 

115-140 

16488-030 

0-106 

2 

3/1 

2 




16978-489 

0-206 

6 

6/2 

2 

286-62 

286-499 

286-479 

16691-990 

0-146 

4 

6/2 

1 




16988-572 

0-220 

2 

22/1 

0 

115-26 

116-149 

116-140 

16873-423 

0*182 

3 

6/1 

3 




17426-222 

0-190 

6 

1-9/1 

5 

115-17 

116-163 

116-140 

17311-069 

0-138 

2 

26/1 

1 




17086-662 

20-225 

0 

23/1 

0 

171-34 

171-369 

171-339 

16916-303 

0-18* 

1 

26/2 

1 




16684-709 

0-118 

3 

5/1 

4 

171-36 

171-336 

171-339 

16413-373 

0-182 

2 

6/2 

2 





obs. 

6/1 

obs. 

obs. 

obs. 

obs. 

5/2 

obs. 

6/2 

obs. 

5/1 

obs. 

obs. 

obs. 



184 

1 

transition 

to 

Zd^Aa 

Zd^Ac 

3s®r 

Zd^a 

sd^n, 

Zd^A^ 

Zd^H 

Zdm^ 

Zd^A^ 

3i»Z 


E. W. Foster and Sir Owen Richardson 


Table 1 


1 -» 1 BANDS 


2 

3 

4 

5 

6 

7 

8 

decimal 

9 

branch 

K'^ 

wave 

number 

V0 

intensity 

Q K 

component 

of wave 
number 

tolerance 

± 

R 

2 

17218-50 

8 

23-0 

a 

h 

•494 

•376 

0-015 

0-03 

Q 

3 

17047-15 

4 

15-0 

a 

h 

•189 

•019 

0-005 

0-01 

Q 

2 

17311-08 

9 

37-0 

a 

?6 

1*078. 

0-940 

0-02 

P 

3 

17139-77 

5 

17-6 

o {») 

6 (y) 

771 

•599 

0-01 

0-03 

R 

2 

17529-00 

5 

12-0 

a 

1b 

8-992 

abs. 

0-02 

Q 

3 

17357-64 

6 

15-6 

a 

b 

•693 

■493 

0-02 

0-02 

Q 

3 

16916-33 

3 

13-0 

Oa (-4) 

&a (S) 

•358 

•165 

0 02 
0-02 

P 

4 

16688-40 

4 

19-0 

— (r) 

•373 

0-02 

R 

3 

16990-71 

6 

21-3 

% (6) 

6i (?*) 

-711 

•50 

0-05 

0*05 

P 

5 

16484-00 

1 

— 


abs. 

abs. 

— 

Q 

3 

17047-15 

4 

15-0 

a 

b 

•189 

•019 

0-005 

0-01 

P 

4 

16820-29 

2 

14-0 

ti "" 

? broad 

0-06 

R 

3 

17591-22 

4 

7-1 

(6) 

h 

■222 
? -07 

0-03 

0-06 

Q 

4 

17364-30 

5 

11-7 

a 

b 

-350 

•216 

0-02 

0-02 

Q 

3 

17357-64 

6 

15-6 

a 

b 

•693 

-493 

0 02 
0-02 

P 

4 

17130-78 

Sot 

— 


[ *78] 

[ *66] 

— 

R 

4 

16469-93 

8 

22-6 

a 

b 

•997 

•897 

0-02 

0-02 

Q 

5 

16190-17 

3 

— 

_ 

abs. 

abs. 


R 

Q 

4 

5 

17335-25 

17055-50 

2 

1 

8-9 


abs. 

abs. 

_ 

Q 

4 

17364-30 

5 

11-7 

a 

b 

•350 

•216 

0-02 

0-02 

P 

5 

17084-56 

2 

9-8 

?a 
• b 

•56 

•48 

0-06 

0-03 

R 

Q 

4 

5 

17739-54 

17459*76 

1 

2 

— 

— 

abs. 

abs. 

— 




2->-2 BANDS 




Q 

1 

16297-25 

5 

22-3 

a (r) 

h {q) 

•294 

•137 

0-03 

0-05 

P 

2 

16187-50 

1 

— 

^2 (? 2 /) 

62 («) 

•556 

•383 

0-03 

0-04 

R 

1 

16730-55 

10 

12-0 

a 

b 

•709 

•520 

0-01 

0-04 

P 

3 

16458-03 

5 

20-8 

a 

b 

8-040 

7-886 

0-004 

0-006 






The fine structure of the lines of the hydrogen spectrum 

185 

{continued) 












1-^1 BANDS 




10 

11 

12 

13 

14 

15 

16 

17 

18 



weight 

intensity 

weight 




intensity 



of 

ratio 

of 




ratio (for 

mean wave 

Av 

Av 

(observed) 

I(a/6) 



Svr 

calculation 

nmnber vp 

(splitting) 

figure 

/(a/6) 

figure 


dvp 

standard 

of mean vj*) 

17218-455 

0-118 

2 

2/1 

0 

171-35 

171-315 

171*339 

obs. 

17047*140 

0-170 

4 

5/2 

2 




obs. 

17311-059 

?0-138 

2 

?6/l 

1 

171-31 

171-337 

171-339 

obs. 

17139-722 

0-172 

2 

?5/2 

0 




5/2 

17528-967 

[0-15] 

0 

>4/1 

0 

171*36 

171-331 

171-339 

6/1 

17357-636 

0-200 

2 

6/2 

1 




obs. 

16915-303 

0*18* 

1 

?6/2 

1 

226-93 

226-930 

226-909 

— 

16688-373 

— 

— 

— 

— 




— 

16990-658 

?0-21 

0 

3/1 

0 

606*71 

606-676 

506-679 

obs. 

(?yj.l6483-975 

— 

— 

— 

.— 




— 

17047*140 

0*170 

4 

6/2 

2 

226*86 

226*90 

226-909 

— 

16820*24 

?0-12 

1 

?1/1 

0 




o.c. 

17691*184 

?0-152 

1 

?3/l 

0 

226*92 

226*901 

226-909 

obs. 

17364-283 

0*134 

3 

?1/1 

0 




obs. 

. 17357-636 

0-200 

2 

6/2 

1 

226*86 


226-909 

— 

[17130-727] 

(?i;j-17130-76 

[0-121 







— 

16469-942 

0*10o 

3 

4/5 

5 

279*76 

279-792 

279*770 

obs. 

6?M6910-15 

— 

— 

— 

— 




— 

— 

— 

— 

— 

— 

279*76 

— 

279*770 

— 

17364-283 

0-134 

3 

?1/1 

0 

279*74 

279-763 

279*770 

obs. 

17084-52 

?0-08 

1 

?1/1 

0 




obs. 

— 

— 

— ' 

— 

— 

279-78 

— 

279*770 

— 




2-^2 BANDS 




16297*249 

0*157 

1 

?3/l 

0 

109*75 

109-728 

109*770 

5/2 

16187-521 

0*173 

1 

?4/l 

0 




obs. 

16730-655 

0*189 

3 

?5/2 

0 

272-52 

272*659 

272*59o 

6/2 

16457-996 

0*154 

5 

’ 5/2 

2 




obs. 



186 


E. W. Foster and Sir Owen Richardson 


2-J-2 BANDS 


1 

2 

3 

4 

6 

6 

transition 

to 

branch 

K'^ 

wave 

number 

VO 

intensity 

Q K 


Q 

1 

16684*88 

1 

— 

3d»il, 

P 

2 

16676*12 

5 

24*5 


R 

1 

17279*01 

7 

46*4 


Q 

2 

17169*22 

10 

37*8 


R 

2 

16295*02 

3 

— 



Q 

3 

16132*09 

5 

18*4 


B 

2 

16897*19 

6 

10*9 



Q 

3 

16734-20 

4 

11-0 


B 

2 

17365-14 

6 

16-2 



Q 

3 

17192-31 

6 

22*9 


Q 

2 

17169-22 

10 

37*8 

Zd^A^ 


F 

3 

17006-43 

10 

30*9 


Q 

3 

16734-20 

4 

11*0 

CO 


P 

4 

16617*89 

6 

16*8 


Q 

3 

17192*31 

6 

22*9 



P 

4 

16976*03 

5 

17*6 


B 

3 

16763*44 

4 

15*0 

Zdm^ 


P 

6 

16271*37 

2 

— 


R 

4 

17013*50 

1 

_ 

Zd ^JIq 

Q 

5 

?16747*70 

1 

— 

Zd^Aa 

B 

3 

17416*51 

3 

_ 

Q 

4 

17200*19 

1 

— 


B 

4 

16192*25 

6 

15*8 

Zd^J: 


Q 

6 

15926*62 

3^ 

— 


Q 

4 

17200*19 

— 

— 

Zd^A. 


P 

5 

16934*41 

1 

9*8 

Zdm, 

Q 

5 

?16747*70 

1 

_ 

P 

6 

?16431*36 

0 

— 

Zd^A^ 

R 

5 

17541*45 

2 

_ 

Q 

6 

17224*94 

1 

— 

Zd^A, 

Q 

5 

?17233*19 

2h 

10*3 

P 

6 

16916*67 

0 

— 

zdm^ 

R 

5 

16746*17 

1 

_ 

P 

7 

16068*21 

2 

— 


Table 1 


7 

8 

9 


decimal 



of wave 



number 

tolerance 

component 


± 

la (x) 

•912 

0*01 

76 (J 3 ) 

•738 

0-04 

a 

5-137 

0-02 

b 

4*97 

0-05 

a 

9*016 

0-007 

b 

8*830 

0-004 

a 

•232 

0-01 

b 

•087 

0*03 

(ff) 

5*03 

0*05 

h (p) 

4*87 

0-05 

a 

2*134 

0-01 

b 

1*968 

0-015 


*226 

0-02 

6. 

abs. 

— 

1ai 

*326 

0*05 

W (r) 

•165 

0-04 

a 

•17 

0-06 

b 

•04 

0-04 

a 

-280 

0-006 

b 

•130 

0-01 

a 

•232 

0*01 

b 

•087 

0*03 


-360 

0*02 

h 

•184 

0-02 

7ai 

•325 

0*06 

h (r) 

•156 

0*04 

a (6>) 

8*024 

0*02 

6 (a,) 

7-906 

0-02 

a 

•280 

0-006 

b 

•130 

0*01 

?}(») 

? broad 

0*02 

a 

•460 

0*005 

b 

•296 

0*016 

_^ 

abs. 

— 

— ’ 

abs. 

— 

_ 

abs. 

— 

— 

abs. 

— 

_ 

abs. 

— 

— 

abs. 

— 


•312 

0*03 

h 

•191 

0*01 


? broad 

— 


abs. 

— 

-^ 

abs. 

— 

(y) 

•463 

0*04 

7b (X) 

*323 

0*04 

_ 

abs. 

— 

— 

abs. 

— 

—— 

abs. 

— 

— 

abs. 

— 

_ 

abs. 

— 

— 

abs. 

— 

. 

abs. 

— 

— 

abs. 

— 



The fine structure of the lines of the hydrogen spectrum 


187 


(fiowtinued) 

10 

mean wave 
ntunber Vf 

16684-862 

16676-104 

17278-962 

17169-208 

16294-99s 

16132-087 

16897-196 

16734-277 

17366-14« 

17192-237 

17169-208 

17006-302 

16734-277 

16617-960 

17192-237 

16976-932 

16763-388 

(5i»j.l6271-35 

16192-264 

?15926-479 

Gi>jl7200-17 

16934-423 


2->2 BANBS 


11 

12 

13 

14 

15 

16 

17 

18 

weight 

of 

intensity- 

ratio 

weight 

of 




intensity- 
ratio (for 

Av 

(splitting) 

Av 

figure 

(observed) 

Kalb) 

J(a/6) 

figure 

dvR 

dvp 

Svr 

standard 

calculation 
of mean vj) 

?0-174 

1 

?3/l 

0 

109*76 

109*758 

109*770 

5/2 

obs. 

0*167 

1 

4/1 

2 




0*185 

5 

6/2 

2 

109*79 

109*754 

109*770 

obs. 

obs» 

0*145 

2 

6/1 

1 




0*160 

1 

?4/l 

0 

162*93 

162*911 

162*8o 

4/1 

0*166 

3 

6/2 

2 



obs. 

[0*15] 

0 

>4/1 

0 

162*99 

162*919 

162*88 

4/1 

6/2 

0*170 

1 

[6/2] 

0 




0*13 

1 

4/1 

1 

162*83 

162*907 

162*8o 

obs. 

obs. 

0*150 

4 

6/2 

2 



0*145 

2 

6/1 

1 

162*79 

162*906 

162*8o 

—* 

0*166 

3 

1512 

1 





0*170 

1 

[6/2] 

0 

216*31 

216*316 

216*308 

5/2 

obs. 

0*118 

3 

6/6 

2 



0*150 

4 

6/2 

2 

216*28 

216*305 

216*308 

obs. 

— 

— 

>3/6 

0 




o.c* 

0*154 

3 

3/2, 

2 



482*068 

3/2: 





482*07 

482*038 

— 


_ 

_ 

— 

— 

— 

265*76s 

— 

_ 

— 

— 

— 

— 

— 

216*308 

— 

0*121 

2 

3/2 

1 



265*76s 

obs. 



_ 

— 

265*73 

265*785 

— 

— 

— 

— 

— 

265*78 

266*75 

265*765 

5/2 

0*140 

1 

?2/l 

1 




— 

— 

— 

— 

316*34 

— 

316*55 

— 

_ 

— 

— 

— 

316*51 

1 * 

316*5s 

— 




____ 

316*62 

— 

316*65 

— 


677-96 



rss 

1 

transition 
to 2(p^n 

Zdm^ 

U^S 

sdm^ 

um, 

M^Aa 

M^A, 

sd^n^ 

zdm, 

Zd^A^ 

U^E 

Zd^A^ 

Zd^A^ 


2 


branoli 


P 

R 

P 

Q 

P 

B 

Q 

R 

Q 

Q 

P 

R 

Q 

Q 

P 

R 

Q 

R 

P 

Q 

P 

R 

Q 

Q 

P 

R 

Q 

Q 

P 

R 

Q 


jainv/Fbstgf ^ricT Sir Uwefi RiofiaFdson 


3->3 BANDS 


3 

4 

5 

6 


wave 

intensity 


number 


-A.——^ 

K" 

vg 

0 

K 

1 

15931*94 

2 

*— 

2 

15827*48 

1 

— 

1 

16445*80 

3 

14;6 

3 

16186*63 

3 

15*5 

1 

16388*32 

1 

— 

2 

16283*83 

2 

8*6 

1 

17129*89 

8 

45*2 

2 

17025*41 

6 

25*6 

2 

15949*72 

3 

—* 

3 

15795*05 

2 

— 

2 

16341*29 

0 

— 

3 

16186*63 

3 

15*5 

2 

16579*03 

2 

10*8 

3 

16424*30 

00 

— 

2 

17025*41 

6 

25*6 

3 

16870*70 

2 

10*5 

2 

17187*90 

4 

13*7 

3 

17033*21 

4 

20*2 

3 

16476*10 

7 

32*3 

6 

16018*16 

1 

— 

3 

16424*30 

00 

_ 

4 

16218*29 

— 

— 

3 

17239*70 

2 

9*9 

4 

[17033*66] 

— 

— 

3 

17033*21 

4 

20*2 

4 

16827*14 

2 

7*9 

4 

15862*86 

1^ 

— 

5 

?15610*91 

1 

— 

4 

[17033*66] 

— 

— 

5 

16781*72 

la 

— 

4 

U7282*98 

1 

— 

5 

?17030*88 

1 

_ 


Table 1 


7 

8 

9 


decimal 



of wave 



number 

tolerance 

component 

VjF 

± 

a 

2*015 

0*008 

h 

1*849 

0*02 

a 

*495 

0*03 

b 

abs. 

— 

a 

•823 

0*05 

b 

*699 

0*04 

% (“) 

*656 

0*03 

6a (6) 

*624 

0*03 

a 

*366 

0*02 

?6 

*155 

0*03 

— 

u.d. 

— 

— 

u.d. 

— 

a 

*917 

0*008 

b 

*735 

0*005 

©2 

•405 

0*05 

hz ip) 

*239 

0*08 

a 

•767 

0*03 

b 

•632 

0*03 

a 

5*088 

0*015 

h 

4*947 

0*05 

_ 

abs. 

_ 

— 

abs. 

— 

ag (a) 

*656 

0*03 

(Z>) 

*524 

0*03 

— 

u.d. 

— 

— 

u.d. 

— 

_ 

abs. 

— 

— 

abs. 

— 

ctz 

•406 

0*06 

bz ip) 

•239 

0*08 

a 

*742 

0*016 

b (60 

•691 

0*02 

— 

u.d. 

— 

— 

u.d. 

— 

a (z) 

•217 

0*02 

b (y) 

•067 

0*04 

a (x) 

*226 

0*03 

b (P) 

•067 

0*05 

— 

abs. 

— 

— 

abs. 

— 

_ 

abs. 

_ 

— 

abs. 

— 

_ 

abs. 

— 

— 

abs. 

— 

a (y) 

•783 

0*05 

b (0^) 

*616 

0*04 

a 

•217 

0*02 

b 

•057 

0*04 

— 

u.d. 

— 

— 

u.d. 

— 

_ 

abs. 

— 

— 

abs. 

— 

a 

*885 

0*04 

b 

*771 

0*04 

« ky) 

*783 

0*06 

b kp) 

*616 

0*04 

— 

abs. 

— 

— 

abs. 

— 

_ 

abs. 

_ 

— 

* abs. 

— 

— 

abs. 

— 

— 

abs. 

— 




The fine 

structure of the tines of the hydjrogen 'spM¥um 


[continued) 



3-»3 BANDS 




10 

11 

12 

13 

14 

15 

16 

17 

18 



weight 

intensity- 

weight 




intensity 



of 

ratio 

of 




ratio (for 

mean wave 

Av 

Av 

(observed) 

Ka/b) 



Svjt 

calculation 

number vf 

(splitting). 

figure 

I(alb) 

figure 

8vb 

dvp 

standard 

of mean vj) 

15931*968 

0*166 

3 

5/2 

1 




obs. 






104*46 

104*503 

104*482 


15827*465 

[0*15] 

0 

[4/1] 

0 




4/1 

16445*787 

0*124 

1 

5/2 

1 




obs. 






259*17 

259*164 

259*182 


16186*623 

0*132 

2 

?3/l 

1 




3/1 

16388*306 

0*211 

2 

5/2 

0 




obs. 






104*49 

104*496 

104*482 


16283*81 

— 

— 

— 

— 




*““ 

17129*865 

0*182 

6 

5/2 

2 




obs. 






104*48 

104*488 

104*482 


17025*377 

0*168 

1 

[?5/l] 

0 




5/1 

15949*713 

0*135 

2 

3/2 

2 




obs. 






154*67 

164*665 

154*70o 


15795*048 

0*141 

2 

?2/l 

1 




6/2 

G^yi.l6341*27 

_ 

_ 

— 

— 




— 






154*66 

154*647 

154*70o 


16186*623 

0*132 

2 

?3/l 

1 




3/1 






154*73 

— 

154*70o 


17025*377 

0*168 

1 

[?5/l] 

0 




6/1 






164*71 

154*678 

154*70o 


16870*699 

0*151 

3 

2/1 

1 




6/2 

G^i/^17187*88 

_ 

_ 

— 

— 




— 






154*69 

164*71 

164*70a 


17033*171 

0*160 

2 

[5/2] 

0 





16476*178 

0*169 

1 

[6/2] 

0 




6/2 






457*94 

458*038 

458*03 


Gfyj.16018-14 

— 

- — 

— 

— 





16424*28 

— 

— 

— 

— 

206*01 


206*04 

_ 

' 16218*27 

— 

— 

— 

— 





17239*68 

_ 

_ 

— 

— 




— 






[206*04] 

205*98 

206*04 


17033*699 

0*167 

1 

?1/1 

0 




obs« 

17033*171 

0*160 

2 

[6/2] 

0 




— 






206*07 

206*05i 

206*04 


(?vj.l6827*12 

— 

— 

— 

— 





<?i;j.l5862*84 

___ 

_ 

_ 

— 




— 






251*95 

251*988 

251*990 


15610*852 

0*114 

1 

?3/l 

1 




5/2 

17033*699 

0*167 

1 

?1/1 

0 




obs. 






251*94 

262*00 

251*990 


(?yj.l6781*70 

— 

•— 

— 

— 





n7282*96 

■■ 


_ 

— 




— 






252*10 

— 

251*990 


(?vjr? 17030*86 

— 

— 

— 

— 







190 

1 

transition 
to 2p®il 

2d 

Zd^n^ 

Zd^A^ 

Zdm, 

Zd^Aa 

Zd^A^ 

Zd^S 

Zdm^ 

Zd^A^ 

Zd^A^ 

Zd^A, 

Zd^Z 

Zd^Uc 

Zd^A^ 


E. W. Poster and Sir Owen Richardson 

Table 2. Aocubaoy oe oombutations oe leste 

0-^0 BANDS 


2 

Z 

4 

5 

6 

7 

8 

decimal 

9 



wav© 

intensity 


of wav© 

tolerance 



ntimber 

<— 

-- N 


mnnber 

branch 


vd 

G 

K 

component 

vf 

+ 





22-3 

a (y) 

?b 

•896 

0-01 

R' 

1 

16884-94 

4 

? -75 

0-04 





12-0 

Cb 

4-148 

0-008 

Q' 

2 

16764-12 

4 

h 

3-959 

0-005 




2 


CL 

5-035 

0-05 

Q' 

1 

17125-07 


??6 

4-836 

0-05 

P' 

2 

17004-40 

3a 

14-9 


_ 

__ 



17369-32 



a 

•307 

0-006 

R' 

1 

5 

— 

b 

abs. 







CL 

•522 

0-02 

Q' 

2 

17248-55 

1 

— 

b 

abs. 


R' 

1 

17588-00 

5 

22-0 

Oj (d) 

6. (c) 

7-962 

7-833 

0*02 

0-04 

Q' 

2 

17467*13 

6 

10-0 

a 

b 

•220 

060 

0-007 

0-02 



17248-55 



CL 

•522 

0-02 

Q' 

2 

1 

— 

b 

abs. 


P' 

3 

17068-60 

1 

— 

1] 

U. 

— 

R' 

2 

17669-81 

2 

3-3 

— 

abs. 

abs. 

— ' 





9-6 


abs. 

__ 

Q' 

3 

17489-90 

3 

— 

abs. 

— 

Q' 

2 

17467-13 

6 

10-0 

a 

b 

•220 

•060 

0-007 

0-02 

P' 

3 

17287-27 

2 

9-3 

?a, z 
b 

-252 

abs. 

0-03 

R' 

3 

16774-97 

4 

TJ. 

ay (&) 

6i 

•870 

‘75 

0-02 

0-05 

Q' 

4 

16536-89 

i 

8-7 

a 

b 

— 

- 

R' 

3 

17498-65 

2U. 

_ 

_ 

abs. 

_ 

Q' 

4 

17260-69 

0 

— 

— 

abs. 

— 

Q' 

3 

17489-90 

3 

9-6 

_ 

abs. 

— 

P' 

4 

17251-99 

Oa 

— 

— 

abs. 

— 

R' 

4 

17855-86 

0 

_ 

_ 

abs. 

— 

Q' 

5 

17562-23 

3 

— 

— 

abs. 

— 

Q' 

4 

17612-63 

1 

_ 

_ 

abs. 

_ 

P' 

5 

17318-92 

1 

— 

— 

abs. 

— 

R' 

5 

1664^-08 

1 

_ 

_ 

abs. 

. _ 

Q' 

6 

16295-02 

3 

— 

— 

abs. 

— 

R' 

5 

17581-42? 

1 

_ 

_ 

abs. 

_ 

Q' 

6 

17233-19? 

2h 

10-3 

— 

abs. 

— 

Q' 

5 

17562-23 

3 

_ 

_ 

abs. 

_ 

P' 

6 

17213-82 

0 

— 

— 

abs. 

— . 



191 


The fine strvxiture of the lines of the hydrogen spectrum 

ENDiNa ON 2 p ^ n . (s) ‘Weak unbs’ (dashed) 


0^0 BANDS 


10 

11 

12 

weight 

13 

intensity- 

14 

weight 

15 

16 

17 

18 

intensity- 
ratio (for 

mean wave 
number vp 

Av 

(splitting) 

of 

Av 

figure 

ratio 

(observed) 

l{alb) 

of 

!(«/&) 

figure 

SVJR 

dvjp 

$Vs 

standard 

calculation 
of mean yj-) 

16884-869 

0*146 

1 

?4*5/l 

0 

120*82 

120*775 

120*79 

obs. 

5/2 

16764-094 

0*189 

5 

2*95/1 

?5/l 

6 




obs. 

17126-002 

??0*20 

1 ’ 

0 

120*67 

120*62 

120*79 

(?vjl7004-38 

— 

— 

— 

— 





[17004-21] 








5/1 

5/2 

17369-282 

[0*16] 

— 

>5/1 

0 

120*77 

120*812 

120*79 

17248-470 

[0*18] 

0 • 

[5/2] 

0 




17587-936 

0*129 

2 

4/1 

1 

120*87 

120*762 

120*79 

5/1 

5/2 

17467-174 

0*160 

3 

2*5/1 

1 




17248-470 

[0*18] 

0 

[5/2] 

0 

179*95 

179*913 

179*905 

5/2 

17068-567 

?0*10 

1 

?1/1 

0 





— 

— 

—r- 



179*91 

— 

179*905 

_ 

17467-174 

0*160 

3 

5/2 

2 

179*86 

179*962 

179*905 

5/2 

3/1 

17287-212 

[0*16] 

0 









[4/1] 





4/1 

16774-86 

0*12 

1 

0 

238*08 

237*98 

237*93 


16636-87 

— 

— 

— 

— 







_ 

— 

— 

237*96 

— 

237*93 

— 



_ 

_ 

— 

237*91 

— 

237*93 

— 



_ 

_ 

— 

293*63 

— 

293*72 

— 





_ 

293*71 

— 

293*72 

— 



_ 

— 

— 

348*06 

— 

348*42 

— 




_ 

— 

348*23 

— 

348*42 

— 






348*41 

— 

348*42 

— 



192 E. W. Foster and Sir Owen Richardson 


1 

2 

z 

4 

transitioa 
to 2^®iJ 

branch 

K" 

wave 
’ number 
vg 


E' 

1 

17031-67 



Q' 

2 

16916-57 


R' 

1 

16630-47 



Q' 

2 

16515-45 


Q' 

1 

n6899-76 



F' 

2 

?16784-56 


R' 

1 

17138-90 

zdm. 


Q' 

2 

217023-67 


R' 

1 

17431-57 


Q' 

2 

17316-56 


Q' 

2 

16916-57 

Zs^S 


F' 

3 

16744-92 


R' 

1 

17138-90 

'3d»17„ 


p' 

3 

16852-22 

Zd^Aa 

R' 

2 

17508-99 

Q' 

3 

17337-39 


Q' 

2 

17316-56 

3dM« 


F' 

3 

17144-93 

Za^E 

R' 

3 

17139-77 

Q' 

4 

16913-48 


R' 

2 

16988-72 

Zdm. 


F' 

4 

16590-80 


R' 

3 

16630-23 

Zd^E 


Q' 

4 

16303-84 

Zd^n, 

i?' 

3 

17282-98 

Q' 

4 

17056-70 

Zd^Aa 

Q' 

3 

17337-39 

F' 

4 

17111-01 

Zd^Ac 

R' 

3 

17633-88 

Q' 

4 

17407-61 

Zs^E 

Q' 

4 

16913-48 

P' 

5 

16633-03 


R' 

4 

17669-81 


Q' 

5 

17389-47 


Q' 

4 

17407-61 


P' 

5 

17127-12 

Zd^E 

R' 

5 

16405-53 

Q' 

6 

16074-39 


1 BAISTDS 


5 

6 

7 

8 




decimal 

intensity 


of wave 

/.. 



number 

a 

K ' 

component 


la 



abs. 

~~ 

— 

abs. 

A 


2 a (J?) 

•628 

u 


b 

-466 

3 

8-4 

a 

b 

•468 

abs. 

5 

15-0 

a 

b 

-488 

-309 

A 


a, (b) 

•53i 

V 


ht 

abs. 

1 


_ 

abs. 

1 


— 

abs. 

4ch 

— 

« iy) 

b 

•872 

abs. 

1 


^4 

[ - 86 ] 



[ *70] 

6 

17-1 

a^ 

62 

•559 , 
abs. 

5 

15-6 

a 

b 

*579 

•418 

0 


2 a (B) 

•628 



b 

•466 

5 

16-7 

— 

abs. 

— 

abs. 

4^ 


a (y) 

•872 


b 

abs. 

3 

10-2 

a 

226 

•174 

•008 

5 

12-1 

_ 

abs. 

3 

11-5 


abs. 

5 

16-6 

a 

-579 


h 

•418 

4 

10-2 

a 

26 

•863 

•75 

5 

17-6 

— 

abs. 

0 

— 

— 

abs. 

8 

22-7 

oi (y) 

•864 

h (A) 

•635 

1 

8-0 

— 

abs. 


— 

abs. 

5 

14-2 

a 

•234 

1 


26 (p) 

•11 

1 



abs. 

1 

9-3 

— 

abs. 

3 

11-5 

— 

abs. 

2 

— 

— 

abs. 

1 

— 

— 

abs. 

1 

— 

— 

abs. 

0 

— 

— 

abs. 

0 

— 

— 

abs. 

2 

3*3 

— 

abs. 

2 

— 

— 

abs. 

1 

— 

— 

abs. 

0 

— 

— 

abs. 

Oa 

— 

— 

abs. 

0 

— 

— 

abs. 


Table 2 

9 


tolerance 


0-04 

0-02 

0-015 

0-006 

0-008 

0-04 


0*02 


0-006 


0-008 

0-015 

0-04 

0-02 


0-02 


0-005 

0-03 


0-008 

0-015 

0-02 

0-05 


0-03 

0-04 


0-03 

0-04 



The fine structure of the lines of the hydrogen spectrum 193 


(continued) 1 1 bands 


10 

11 

12 

13 

14 

15 

16 

17 

18 



weight 

intensity 

weight 




intensity 



of 

ratio 

of 




ratio (for 

mean wave 

Av 

Av 

(observed) 

I(alh) 



SVB 

calculation 

number 

(splitting) 

figure 

l{alb) 

figure 

^Vs 

Svjp 

standard 

ofmeanvjp) 

G«i;pl7031*65 

— 

— 

— 

— 




— 






115*10 

115*06 

116-03 


16916*587 

?0*162 

1 

?3/l 

0 




obs. 

16630*443 

[0*15] 

0 

>6/1 

2 




6/1 






115*02 

115*006 

116-03 


16515*437 

0*179 

5 

[5/2] 

1 




obs. 

16899*506 

[0*15] 

0 

— 

— 




6/1 






115*20 

114*966 

115-03 


(?j;jrl6784*64 

— 

— 

— 

— 





17138*847 

[0*15] 

0 

[5/1] 

0 




6/1 






115*23 

— 

116-03 


<?j;j.l7023*65 

[0*16] 

0 

[5/2] 

0 




— 

[17023*817] 









17431*534 

[0*15] 

0 

[6/1] 

0 




5/1 






115*01 

115*001 

116-03 


17316*533 

0*161 

4 

6/2 

2 




obs. 

16916*587 

?0*162 

1 

?3/l 

0 




— 






171*65 

171*687 

171-66 


G'i;j.l6744‘90 

— 

— 

— 

— 





17138*847 

- 

_ 

_ 

— 




—* 






286*68 

286*710 

286-68 


16852*137 

0*166 

3 

3*5/1 

2 




' obs. 

— 

— 

— 

— 

_ 

171*60 

— 

171-66 


17316*533 

0*161 

4 

5/2 

2 




— 






171*63 

171*701 

171-65 


17144*832 

0*0l3 

2 

4/1 

2 





0*^2.17139*75 

— 

— 

— 

— 

226*29 


226-34, 

— 

01/2.16913*46 

— 

— 

— 






16988*80 

0*229 

0 

[5/2] 

0 




5/2 






397*92 

398*02 

307-89, 


Oi/j.16590-78 

— 

— 

— 

— 



- 


16530*208 

0*12 

1 

3*5/1 

1 




3*5/1 






226*39 

226*388 

226-342 


Oi/j.l6303*82 

— 

— 

— 

— 





— 

— 

— 

— 

— 

226*28 

_ 

226-342 

— 

— 

— 

— 

— 

— 





— 

— 

— 

— 

— 

226*38 

— 

226-342 

— 

— 

— 

— 

— 

— 

226*37 

— 

226-342 

— 

— 

— 

— 

— 

— 

280*45 

— 

280-400 

- 

— 

— 

— 

— 

. — 

280*34 

— 

280-400 


— 

— 

— 

— 

— 

280*39 

— 

280-400 

— 


331*14 


331*20 



194 E. W. Foster and Sir Owen Richardson 


1 

2 

3 

4 




wave 

transition 



number 

to 

branch 

K" 

G.M.L. 



5 

17389*47 

P' 

6 

17058*20 


B' 

1 

16332*11 

3d^i: 

0' 

2 

16222*72 


Q' 

1 

16648*40 


P' 

2 

16538*98 


B' 

1 

17280*68 

3dM^ 

Q' 

2 

17171*27 


B' 

1 

16822*00 

sdm. 

P' 

3 

16548*90 


B' 

2 

17348*12 

Sd^Ja 

Q' 

3 

17184*46 


Q' 

2 

17171*27 


P' 

3 

17007*59 


B' . 

3 

17247*26 

3d^i: 

Q' 

4 

16032*09 

3dm, 

B' 

3 

16961*25 

«' 

4 

16746*17 


Q' 

3 

17184*46 

sd^^a 

P' 

4 

16969*39 


B' 

3 

17433*60 


Q' 

4 

17218*50 

Sd^a 

B' 

4 

17481*61 

Q' 

6 

17214*05 


Q' 

4 

17218*60 

Bd^J, 

P' 

5 

16950*98 

Sd^Ja 

Q' 

5 

17214*05 

P' 

6 

16899*76 

3dm 

P' 

1 

16827*69 

Bdm^ 

P' 

2 

16746*56 

3dma 

B' 

4 

16752*69 

3dm^ 

P' 

4 

16367*90 


2 - h 2 BANDS 


5 

6 

7 

8 




decimal 

intensity 


of wave 
number 

Q 

K 

component 

Vf 

2 



abs. 

0 

_ 

_ 

abs. 




*141 

1 



abs. 


10*0 

a 

•743 

2 

h 

*575 



a 

*411 

2 


h 

abs. 


7*4 

a 

•99 

2 

h 

*88 

3 

17*5 

a 

1h 

*661 

•565 

4 

13*9 

a 

b 

*290 

*115 

2 

6*7 

fas (r) 

(g) 

1*96 

1*78 

3 

11*1 

— 

abs. 


9*4 

a 

•144 

3 

?6 

•006 

3 

10*7 

a 

b 

•428 

•320 

4 

13*9 

a 

b 

•290 

•115 

1 

— 

— 

abs. 

2 

— 

— 

abs. 

la 

— 

It] (»)} 

•134 

0^ 

_ 

_ 

abs. 

1 

— 

— 

abs. 

3 

10*7 

a 

b 

•428 

•320 

1 

— 

— 

abs. 

1 


(?y) 

•690 

X 


?6i (a?) 

*378 

8 

23*0 

a 

b 

•494 

•376 

1 

_ 

_ 

abs. 

U. 

— 

— 

abs. 

8 

23*0 

a 

b 

•494 

•376 

Oa 

- 

— 


U. 

_ 


abs. 

0 

— 

— 

abs. 

1 

— 

— 

abs. 

2 

— 

— 

abs. 

Oa 

— 

— 

abs. 

1 

8*4 

— 

abs. 


Table 2 


9 


tolerance 

± 


0-04 


0-026 

O-Olg 

0-02 


0-04 

0-06 

0-02 

0-05 

0-03 

0-01 

0-08 

0-08 


0-02 

0-03 

0-06 

0*06 

0-03 

0-01 


0-05 

0-06 


0-05 

0-02 

O-Olg 

0-03 


0*01s 

0-03 



The fine strudure of the lines of the hyd/rogen spectrum 


195 


{continued) 

10 

mean wave 
number 

16332*116 

16222*695 

16648*386 

16538*96 

17280*647 

17171*246 

16821*92 

(?i^jl6548*88 

17348*105 

17184*401 

17171*246 

Gvj^l7007-57 

16032-134 

17184*401 

16969*37 

17433*537 

17218*460 

17218*460 


2->2 BANDS 


11 

12 

weight 

of 

13 

intensity 

ratio 

14 

weight 

of 

Jv 


(observed) 

J(a/6) 

(splitting) 

figure 

I(a/6) 

figure 

— 

— 

— 

— 

— 

— 

— 

— 

[0*15] 

0 

— 

0 

0*168 

3 

5/2 

2 

[0*15] 

0 

^4/1 

0 

0*11 

0 

3/1 

1 

0*096 

2 

m 

5 

0*175 

3 

3/1 

2 

?0*17 

1 

?5/2 

0 

— 

— 

— 

— 

0*138 

2 

?5/2 

1 

?0*108 

1 

?3/l 

0 

' — 

— 

— 

— 

— 

— 


— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 


— 

— 

— 

?0*21, 

0 

?3/l 

0 

0*118 

1 

2/1 

0 


15 

16 

17 

A'v 

.18 

intensity 
ratio (for 

A'v 


G.M.L. 

calculation 

G.M.L. 


standard 

of mean v) 

331*27 

— 

331*20 

5/1 

109*39 

109*421 

109*40o 

obs. 

5/1 

109*42 

109*426 

109*40o 

obs. 

109*41 

109*407 

109*40a 

5/2 

4/1 

273*10 

273*04 

273‘09a 

obs. 

163*66 

163*704 

163*692 

3/1 

163*68 

163*676 

163*69j 

— 

215*00 

215* 10« 

215*0, 

— 

215*08 

— 

215*0, 

— 

215*07 

215*03i 

215*0, 

obs. 

215*00 

215*077 

215*0, 

5/2 

267*56 

— 

267-5* 


267*52 

267*50o 

267-5* 

: 

314*29 

_ 

314** 

— 


16950*96 



196 E. W. Foster and Sir Owen Eichardson 


1 

transition 
to 2p*i7 

uma 

3d®J7, 

Sd^A^ 

Zd^Aa 

Zd^A, 

3d»ir 

Zdm, 

Zd^A^ 

zdm^ 

Zd^A^ 

Zd^E 


Table 2 


3->3 BANDS 


2 

3 

4 

5 

6 

7 

8 

decimal 

9 



wave 

intensity 


of wave 




number 


-^ 


number 

tolerance 

branch 

K" 

Vo 

Q 

K 

component 

vf 

+ 






_ 

abs. 

_ 

B' 

1 

?15976-14 

0 

— 

_ 

abs. 

— 

Q' 

2 

?15872*20 

1 

— 

a 

h (‘ix) 

[ -271] 
•101 

0*02 

Q' 

1 

16363*49 

1 

— 

— 

abs. 

. — 

P' 

2 

16269*53 

1 

— 

— 

abs. 

— 



16511*46 



Cb 

•470 

* 0*03 

R' 

1 

1 

— 

?6 

*32 

0*05 




4 

12*0 

Obt 

•820 

0*02 

P' 

3 

16251*79 

?6i 

•679 

0*05 



[17130*30?] 



a 

•276 

0*02 

R' 

1 

— 

— 

b 

•10 

0*06 

Q' 

2 

17026*36? 

2 

— 

% M 
b, (6) 

•368 

•189 

0*02 

0*06 




2 

8*9 

?ct 

?6*018 

0*08 

R' 

2 

17186*00 

?6 

?5*882 

0*08 

Q' 

3 

17030*19 

1 

— 

— 

abs. 

abs. 

— 

Q' 

2 

17026-36? 

2 

— 

<h. (“) 

6i (&) 

-368 

*189 

0*02 

0*06 

P' 

3 

16870*70? 
(0*36 M.B.) 

2 

10*5 

% (6') 

0l 

•591 

abs. 

0*02 

R' 

3 

16910*77 

0 

_ 

_ 

abs. 

— 

Q' 

4 

15706*57 

1 

— 

— 

abs. 

— 

R' 

3 

16633*03 

0 

_ 

_ 

abs. 

— 

Q' 

4: 

16428*77 

0 

— 

— 

abs. 

— 

Q' 

3 

17030*19 

1 

— 

z 

abs. 

abs. 

. 

P' 

4 

16826*98 

1 

— 

?aa (a?) 

??52 

6*027 

5*917 

0*03 

0*05 

R' 

3 

17244*91 

0 

— 

— 

abs. 

abs. 

— 

Q' 

4 

17040*75 , 

1 

— 

a 

b 

*803 

*675 

0*02 

0*03 

R' 

2 

16465*55 

2 

— 

?a 

?6 

*654 

•480 

0*04 

0*02 

P' 

4 

16105*59 

1 

— 

_ 

abs. 

abs. 


R' 

4 

17287*27 

2 

9*3 

?a {z) 

•252 

0*03 

Q' 

5 

?17036*21 

ab 

— 

— 

abs. 

— 

P' 

1 

15913*37 

00 

— 

— 

abs. 

— 


Notes on tables 1 and 2 

1. The figures given for tolerances’ in column 9 are those given in column 7, 
table, 3, Paper I; the method of deriving them is explained in the general notes to 
table 3, Paper I. 

2. The figures of column 12 for weighting the Av^z should be a simple function of 
the corresponding figures of *coluinn 9, but in actual fact they were not compiled 
from those figures but were based on a second review of the data and thus form an 
independent set of estimates, and these figures have been chosen for the final weigh¬ 
ting for computing mean splittings of the different levels (table 3). 



The fine structure of the lines of the hyd/rogen spectrum 197 


{continued) 

3-»3 BANDS 


10 

11 

12 

13 

14 

16 

16 

17 

18 



weight 

intensity 

weight 




intensity 



of 

ratio 

of 




ratio (for 

mean wave 

/Iv 


(observed) 

Ha/b) 




calculation 

number vj> 

(i^litting) 

figure 

I(ajb) 

figure 

$Vs 

dvj> 

standard 

of mean vj') 

€?i^j.l6976-12 


— 

— 

— 

103-94 

103-90 

103-944 

— 

715872-222 

[0-17] 

0 

— 

0 




5/2 




_ 

_ 

103-96 

— 

103-944 

— 

16511-483 

0-16 

1 

?3/l 

0 

269-67 

259-641 

259-769 

obs. 

16261-792 

0-141 

1 

4/1 

0 




obs. 

17130-247 

0-176 

1 

6/1 

0 

103-94 

103-930 

103-944 

obs* 

17026-317 

0-179 

1 

?6/2 

0 




obs* 

17186-98 

??0-136 

1 

??6/2 

0 

165-81 

155-81 

. 165-8i6 

5/2 

G'v^l7030-17 

— 

— 

— 

— 



— 

17026-317 

0-179 

1 

?6/2 

0 

166-66 

166-77 

165-8i5 

obs. 

16870-66 

[0-16] 

0 

— 

0 



3/1 

— 

— 

— 

— 

— 

204-20 

— 

204-228 

— 

— 

. 

— 

— 

— 

204-26 

— 

204*228 

— 

Ofyj.17030-17 

— 

— 

— 

— 

204-21 

204-17s 

204-228 


16826-996 

?0-ll 

1 

^4/1 

0 




6/2 

avjpl7244-89 

— 

— 

— 

— 

204-16 

204-12 

204-228 

— 

17040-766 

70-128 

2 

72/1 

0 




6/2 

16466-604 

0-174 

1 

71/1 

1 

369-96 

360-03 

360-043 

6/2 

(?vjp16106-67 

— 

— 

— 

— 




— 

17287-262 

Gfvj.17036-19 

— 

_ 

_ 

— 

251-06 

251-66 

7264-06 

— 

_ 

_ 

___ 

_ 

_ 

_ 

— 

— 

— 


3. When figures axe given in square brackets, either they are entirely predictions 
or are predictions with only very feeble direct experimental evidence. , 

4. The wave numbers Vq in column 4 are fix»m the tables of Gale et at. (1928) and 
so axe the eye estimates of intensity in column 5. The intraisities under K (column 6) 
axe the energy measmes of Kapuscinsky & Eymers {1929). 

6. The flgilres (column 8) xmder decimal of wave number Vp are our measures for 
the fine-sixucture components a, b (column 7) of the unresolved oompomte lines 
measured by Gale at. (column 4). The doublet ^littings in column 11 are tbe 

IS 


V0L189. A. 



198 E. W. Foster and Sir Owen Richardson 

diflEerences of the a and b values of Vjpt in column 8. The weights 0-6 in column 12 
divide the of column 11 into seven groups such that we should expect tho^e 
with the same number to have about the same accuracy, those with the highest 
accuracy having the highest number (6). 

6. Column 13 gives the ratio of the intensity of the component a in column 8 to 
that of 6. These are mostly eye estimates, but some were based on photometric 
measures. We believe that this relative accuracy is fairly assessed by the series of 
numbers 0-6 in column 14 in the same way as those in column 6 represent the 
accuracy of the sphttings. 

7. The mean wave numbers in column 10 are the values which we calculate 
for the wave numbers of the composite lines from our measurements of the of 
the components in column 8 using the intensity ratio /(a/6) given in column 18. 
These are not always the same as our observed ratios given in column 14. Some of 
the observed ratios are known to be wrong owing to various causes. We have made 
an attempt to eliminate the effects of these in the values given in column 18 which 
differ from those given in column 13. Unfortunately, it is often difficult to find out 
to what extent we have been successful in this. 

8. The values in column 15 are the wave-number differences 

of consecutive rotational terms found by Richardson & Davidson (1933). In most 
cases, these are based entirely on the measures of Gale et ah and are just the differ¬ 
ences of successive pairs of v^’s in column 4. The values in column 16 are the same 
differences when the wave-lengths of the composite lines have been constructed 
from our measured wave numbers of the constituent doublets and are the differences 
of the successive pairs of values in column 10. 

The ‘standard’ values in column 17 are the weighted means of all available 
values for the same pairs of final rotational levels published in Richardson & 
Davidson (1933). 

9. When the letters u.d. occur in these tabled, they replace one of the following 
expressions in the table in its original form: ‘unreliable data’, ‘not measured’, 
‘probably double’, ‘no useful data’, ‘not observed’, ‘nothing due to this line ob¬ 
served’, ‘too weak to have visible effect on pattern’, ‘not measured, probably 
double ‘ only one component observed, too faint to be sure of detail ’. These phrases 
have different meanings, but the main point for the purpose of this paper is the 
effect of what they express on our measured or estimated numbers, and this has been 
taken into account in estimating the weights and tolerances in the relevant columns. 

10. The letter ‘U.’ in these tables means ‘unresolved’ and ‘o.c.’ means ‘setting 
on opticUl centre’, ^ 

11. In a few cases tolerances are not quoted in column 9. In these cases reference 
should be made to table 3, Paper I. 

12. For discussion of the asterisked zlv’s of column 11, table 1, 1->1 bands, see 
discussion in Paper I, p. 159. 

{End of notes on tobies 1 and 2) 



199 


The, fine striicture of the lines of the hydrogen spectrum 

{ooniiniied from'p- 179) 

(3) There is only one case when values of B'(m) — Q'{m +1) and Q'{m) — P'(m +1) 
are available for the same m (for m = 1 in the 2-2 bands), but in this case the corre¬ 
sponding statement to (2) for the dashed transitions is applicable. 

(4) If by Q(m) — P{m+ 1) a and Qin—P(m+ 1) b one denotes the Q(m) — P(m+ 1) 
a and b differences and so on, then 

Ql-P2a>Ql-P2b, Rl-Q2a>Rl-Q2b, 

Q2 — P3a<Q2 — PZb, R2 — QZa<R2 — QZb, 

Q3 —P4a> Q3 —P46, P3 —Q4a>P3 —Q46, 

There are three oases in the 2-2 and 3-3 bands which show small differences on 
the wrong side of these inequalities, but they are aU doubtful; otherwise, they apply 
without exception to the results for aU the bands, 0-0, 1-1, 2-2 and 3-3, for which 
data are available. 

(5) For the dashed transitions the data are much less numerous and also less 
representative. There are seven examples of R'l — Q'2 a and b for all of which 
R'\ — Q'2a is <P'l —Q'26. The rest consist of one example each of Q'l — P'2 for 
which Q'l — P'2 a is > Q'l — P'2 b, of P'2— Q'2, for which P'2 — Q'Z a is > P'2 — Q'3 b 
of Q'2—P'3, for which Q'2—P'3 a is >Q'2 — P'3b and of P'3 —Q'4 for which 
P'3 — ^'4 a is > P'3 —Q'4:b. Thus in nine out of the eleven examples the inequalities 
are in the opposite sense to those for the corresponding undashed transitions. It 
looks, therefore, as if there is in the main a reversal of sign of these inequalities in 
the dashed as compared with the undashed transitions. The two exceptions both occur 
in the 2-2 bands. There were three rather dubious exceptions to the general state¬ 
ment in the imdashed transitions, aU in the 2-2 and 3-3 bands. It may be that the 
rule only applies strictly to the lower vibrational levels and is tending to break 
down at the higher levels. The evidence is too scanty to say more than this. 

(6) There is also evidence of another regularity when tve compare the 

R> Q{‘>n) — Q, Pim + 1) 

a and b differences which come from the 3d ®Zr^, bands with the 

Q, P{m)-R,Q{m + l) 

a and b differences which come from 11^, A(,-^2p^II. Taking the mean values 
given in table 6 we have: 

0-»0 bands Rl-Q2a-Ql-P2a = -t-0-038, Rl-Q2b + Ql-P2b = 4-0-027 

Q2-P3a-R2-Q3a = + 0-008, Q2-P3b-R2-Q3b = -0-012 

1- >16a%ds PI —Q2a—^1 — P2a =-1-0-017, PI —Q26 — Q1 — P26 =-f 0-007 

Q2-P3a-R2-Q3a = -)-0-009, e2-P36-P2-^36 =-0-036 

2- ^2 bands Rl-Q2a-Ql-P2a^+0-031, Rl-Q2b-Ql-P2b = -0-021 

Q2~P3a-R2-Q3a^+(y008, Q2-P36-P2-Q36 =-0-009 

3- ^3 bands Rl—'Q2a—Ql — P2a = —0-012 

Q2 - P3 a - P2 - ^3 a = - 0-018, Q2-P3b-R2^Q3b ==>-0-0^ 


13-2 



200 E. W- Eoster and Sir Owen Richardson 

Table 3. Final AsmsYMMETRio (a) rotational terms 


0->0 BANDS 


initial state 

^1-P2 

P1-Q2 

P2-Q3 

Q2-P3 

«3-P4 

CO 

1 

P4-<Q6 

Q4-P6 


120*68, 

_ 

180-046 

— 

— 

— 

294*276 

— 


— 

120-7305* 

— 

180-052t 

— 

237-683 

— 

— 

zdm^ 

120-716 

— 

180*046 

— 

237-682 

— 

294*37 

— 

Zd^Aa 

— 

120-737 

— 

180*056 

— 

237-72, 

— 

294*263 

ZdA^ 

— 

— 

180-07 

— 

237*694 

— 

— 

.— 

weighted 

means 

120-716 

120*734 

180-046 

180*0635 

237-688 

237*705 

294-325 

294*263 

differences 

— 

0-018 

— 

0-007s 

- 

0-01, 

+ 0*0 

*72 


* Gk)t by subtracting Rl-Q2= 120-737 from 300-784 = i?l-P3 of 3d®Zri->-2p®JT. 

t Got by subtracting 3d®Jj. Q 2-P 3 = 180-053* from 300-784 = P1 -P 3 of 3d»7ra^2p®ir. 


l->-l BANDS 


3«®P 

115*144 

— 

171-359 

— 

226-930 

— 

— 

— 

3d»i; 

116-161 

— 

171*336 

— 

— 

— 

279*782 

— 


— 

116-164* 

— 

171*336t 

— 

226*91 It 

— 

279*773§ 

3d»ir, 

116-149 

— 

171*316 

— 

226-90 

— 

— 

— 

3d»Ja 

— 

116*163 

— 

171*337 

— 

226-901 

— 

279*763 

3d»A, 

— 

— 

171-331 

— 

— 

— 


— 

weighted 

116-161 

116*1635 

171*335 

171*336 

226*915 

226-906 

279*782 

279*765 


means 


differences — 0*0125 — 0-001 ^O-OO^ +0-016 

* Got by subtracting Q2—P3 from Bl—P3 s= 286-499. 

t Grot by subtracting J? 1 — Q 2 from P1—P 3 «= 286*499. 

t Got by subtracting 04«—P6 from P3--P6 = 506*676. 

§ Got by subtracting R^^Qi: from P3—P6 = 506*676. 


2->2 BANBS 


3d»r 

109*728 

— 

162*911 

— 

— 

— 

266*786 

— 


— 

109*753* 

— 

162-906t 

— 

216-308t 

— 

— 

3dm, 

109*768 

— 

162-919 

— 

216-316 

— 

— 

— 

3d»zla 

— 

109-764 

— 

162*919 

— 

— 

— 

265*74 

3d»zl, 

— 

— 

162*907 

— 

216*305 

— 


— 

wei^ted 

109*748 

109-7635 

162*9113 

162-9065 

216*3105 

216-308 

266-785 

265*74 


means 


differences — 0*01os +0*0058 +0*0025 +0*045 

* Got by subtracting Q2—P3 from PI—P3 = 272*659. 

t Got by subtracting P1—Q2 from Zd^IJ^, P1—P3 = 272*659. 
t Got by subtracting Q4i—P6 from P3—P5 = 482-048. 


3^3 BANDS 


3d®P 

104-603 

— 

154-666 

— 

— 

— 

251-978 

— 

Zdm ^ 


104-486* 

— 

164-647 

164-677t 

— 

206-038t 

— 

252*07,§ 

3 dm , 

104*60e 

— 

— 

_ 

_ 

_ 

— 

3d®zla 

— 

104*488 

— 

164-678 

— 

205*97i 

— 

262-008 

3d>.l„ 

— 

— 

154-698 


2O6-O61 



— 

weighted 

104-504 

104*487 

154*682 

154*678 

2O6-O61 

206-005 

251*978 

262*043 


means 


differences +0-017 +0-004 +0-056 -O-O65 

* Got by subtracting Q2-P3 from P1-P3 ^ 259*164. 

t Got by subtracting P1—Q 2 from 3d®il^, P1 —P3 = 259*164. 
i Got by subtracting Q4-P5 from Zdm^^ P3-P5 = 458-048. 

§ Got by subtracting P3—Q4 from 3d^iT^, P3-P5 « 468*048. 



The fine structure of the lines of the hydrogen spectrum 


201 


Table 4. Final symmetric (s) rotational terms 


0->0 BANDS 


initial state 

O'l-P'2 

ie'i-Q'2 

i2'2-e'3 Q'2^P'Z 


P'4-^'5 

0'4-P' 


— 

120-775 

— — 

— 

237-98 

— 

— 


120-63 

— 

— — 

— 

— 

— 

— 


— 

120-812 

— 179-913 

— 

— 

— 

— 


— 

— 

— — 

— 

— 

— 


U^A^ 

— 

120-762 

— 179-962 

— 

— 

— 

— 

weighted 

120-63 

120-783 

~ 179-937 

— 

237-93 

— 

— 

means 

differences 

- 

O-ls 

— 

— 



— 

Zs^E 


115-05 

1->1 BANDS 

-- 171-697 





Zd^E 

— 

115-003 

— — 

— 

— 

— 

— 

Zdma 

114-98 

— 

—, — 

— 

— 

— 

— 

Zd^n^ 

— 

— 

— 171-706* 

— 

— 

— 

— 

Zd^Aa 

—. 

— 

— — 

— 

— 

.— 

— 

Zd^A, 

— 

115-001 

— 171-701 

— 

— 

•— 


weighted 

114-98 

115-004 

— 171-703 

— 

— 

— 


means 

differences 

— 

0-024 

— 

— 



— 

Zd^E 


♦ 

109-42 

From 3d>iT„, E'l-P'3 aad R'\-Q'2. 

2->2 BANDS 

— 163-64t — 

215-096 



WUa 

109-42e 

— 

- - 

— 

— 

— 

— 

um. 

— 

— 

-- - 

— 

— 

— 

— 

Zd^A^ 

— 

— 

163-704 — 

216-04i 

— 

— 

— 

UU, 

— 

109-407 

— 163-68e 

— 

215-077 

— 

— 

weighted 

109-423 

109-407 

163-704 163-665 

216-04i 

216-077 


— 

means 

differences 

+ 0-01, 

. +0-033 

-0-03, 

- 



t From E'l-P'S aad Zd?A„ B'l-Q'2. 


3^3 BANDS 


3d*i7 

um^ 

— 103-91 

155-88t — 

204-225§ -- 



~ 103-930 

165-82 — 

— 165-77 

204-165 — 

— 204*114 

261-073 — 

weighted 

— 103-930 

155-85 155-77 

204-195 204-II4 

251-073 — 

meaoB 





differeaces 

— 

+0-08 

+0-08^ 

— 


t From 3d®JT<^ iJ'2-P'4 and U’‘Aa. Q'S-F'4. 

§ From 3d»ni, P'2-P'4 and B'2-Q'3. 




202 


E. W. Foster and Sir Owen Richardson 


Table 5. Compabison of dv/s ajstd Sv^‘s. 
(The new and old final kotational tebms) 


Q1-P2 111-Q2 B2-QZ Q%-PZ QZ-P4. RZ-Qi R4-Q5 Q4-PS 

v‘' = 0 


mean 

120*716 120*734 

180*046 180*0635 

237*688 237*705 

294*825 294*253 

differences 

-0*018 

-0*007s 

-0*01, 

+ 0*672 

mean 

120*732 120*760 

180*035 180*037 

237*695 237*653 

294*240 294*237 

differences 

-0*018 

-0*002 

+ 0*042 

+ 0*003 


mean 

115*161 116*1635 

171*335 171*336 

226*915 226*906 

279*782 279*766 

differences 

o 

o 

1—I 

-0*001 

+ 0*009 

+0*018 

mean dyjt 

115*110 116*140 

171*338 171*340 

226*918 226*90s 

279*768 279*845 

differences 

-0*030 

-0*002 

+ 0*003 

-0*077 

mean 

109*74s 109*763fi 

162*9113 162*9065 

f 

216*3^05 216*308 

266*785 266*74 

differences 

o 

© 

1 

+ 0*0063 

+ 0*0026 

+ 0*045 

mean Svji 

109-776 109-76* 

162*85 162*76 

216*295 216*32 

266*763 265*78 

differences 

-fO-Ol* 

+ 0*09 

— 0*026 

-0*0i7 

p" = S 

mean 

104*604 104-487 

154*683 164*678 

206*06i 206*005 

261*978 262*043 

differences 

+ 0*017 

+ 0*004 

+ 0*058 

- 0*065 

mean Svjg 

104*473 104*493 

164*690 164*710 

206*040 206*14 

252*005 251*975 

differences 

-0*020 

-0*020 

, +0*080 

+ 0*03o 


e'l-P'2 P'l-Q'2 

B'2-Q'Z Q'2-P'3 

P'3-.g'4 

P'4-e'6 §'4-P'6 

o 

ii 

mean 

120*63 120*783 

— 179*937 

— 237*98 

— — 

differences 

-0*l53 

— 

— 

— 

mean Svjt 

120*73 120*80 

179*906 179*905 

237*91 237*96 

293*76 293*69 

differences 

-0*07 

0*000 

-0*05 

+ 0*06 

mean Spjk 

114*98 116*004 

— 171*703 



differences 

-0*024 

— 

— 

*— 

mean Spji 

— 115*03 

171*6o 171*663 

226*38 226*338 

280*34 280*42o 

differences 

— 

6 

1 

+ 0*047 

0 

0 

1 


mean Spj^ 

109*423 109*407 

163*704 163*665 

216*04i 215*077 

267*56 267*51o 

differences 

+ 0*013 

+ 0-039 

-0*038 

+ 0-05o 

mean Spjr 

109*42 109*395 

163*70o 163*68o 

215*07 215*08 

267'56 267-52 * 

differences 

+ 0*025 

+ 0*02o 

0 ^ 

6 

1 

+ 0*04 

v^==3 

mean ^Pp 

— 103*930 

155*85 155*77 

204*195 204*114 

25 I-O 72 

differences 

—* 

+ 0*08 

+ 0*08i 

— 

mean ^Pp 

103*965 103*930 

166*815 166*73 

204*2i 204*238 

261*03 — 

differences 

+ 0*035 

+ 0*0g5 

-0-0^ 

— 



203 


The fine stntctui^e of the lines of the hydrogen spectrum 
Table 6. Stimmary of Q(m)—P(m +1) and B{m) — Q{m + 1). 

a AND 6 DrEEBRBNCES TAKEN SEPARATELY 


3d»i:, (31-P2 
3d*ir„ Q1-P2 
means 


3d»-S, P2-03 
P2-Q3 
means 


Q3-P4 


0-»0 BANDS 


a 

b 


a 

h 

0*715 

0*661 

3d»n„Bl-Q2 

0*755* 

0*721* 

0*735 

0*722 

3d»A„Bl-Q2 

0*771 

0*717 

0^725 

0^692 


0-763 

0-719 

+ 0*033 


+ 0*044 

0*020 

0*079 

Zdmt, Q2-P3 

0*033t 

0*038t 

0*037 

0*019 

3d^A„ Q2-PZ 

0*041 

0*036 

0-029 

0-049 

• 

0-037 

0-037 


0*020 


0*000 


0’68, 

0-62o 





+ 0’06o 

* From 3d*iTi, P1-P3 and 3dMi, Q2-P3. 

i From3ci:»iTi, Pl-P3and3d®di,Pl-(32. 


1^1 BANDS 


3s»2:, Q1-P2 
3d»P, Q1-P2 
3d»i7„ (31-P2 
means 


3s*2', B2-QZ 
U^2,Bi-qZ 
3d»iTo, P2-Q3 
3d«do, B2-Q3 
means 


3«»i7, QS-Pi 
Um„ Q3-P4: 
Zd^A„ Q3-P4 
means 


0*178 

0*135 

0*191 

0*10x 

0*182 

0*144 

0-184 

0-134 

+ 0-060 

0*342 

0*310 

0*304 

0*368 

0*305 

0*357 

0*299 

— 

0-303\\ 

0-362\\ 


-0-069 


3d»iT<„Pl-Q2 
3dMtf, P1-Q2 


3d»J7«, e2-P3 
3d®d4, Q2-PS 

•\ 


0-204t 0-136 

0-198 0-146 

0-201 0-141 

+0-060 

0-318§ 0-311§ 

0-307 0-341 

0-312 0-326 

-0-009 


0-872 0-864 

+0-018 

P3. 


0-986 — 

0-949 — 

[0-913] [0-833] 3d^4, P3-Q4 

0-913 0-833 

+ O-O 3 

{ From 3d*!!* P1-P3 and3d*!7d* Q2- 
§ From 3d*!I*Pl-P3 and P1-G2. 

II Excluding 3s *P, P 2—Q 3. 


3d®Z', 01-P2 
3d*JT„181-P2 
means 


3d»P, P2-Q3 
3d»!T„P2-<33 
3d*^„ B2-Q3 
means 


2->2 BANDS 

0-738 0-764 3d ®!I* P1 - Q 2 

0-776? 0-768 3d®J*Pl —Q2 

0-7571 0-761 

-0-004 

0-896 0-902 3d»!7* Q2-P3 

0-901 d.i. 3d®d*Q2—P3 

0-890 0-91o 

0-896 0-90, 

-O-Olo 


0-806f 0-73711 

0-783 0-743 

0-794 0-740 

+ 0-064 

0-926** 0-891** 

0-882 0-903 

0-90t 0-897 

+ 0-00^ 


0-301 0-249 

+0-062 

^ From3d»!I* Pl-PSandmeaniZad* C2-P3. 
»* From3d»ir*Pl-P8and3d»d*Pl-Q2. 


3d»!T« Q3-P4 



204 


E. W. Foster and Sir Owen Richardson 


Table 6 (continued) 
3-»3 BAJSTDS 


3d»P, 01-P2 

a 

0*520 

6 

di. 

3d«iT* Pl-02 

means 

0*320 — 

3d»/(* Pl-02 

3d*P, P2-03 

0-679 

0-686 

3d»JT* 02-P3 

means 

0-679 

0-685 

3d*/l* 02-P3 


-0*006 



a b 

0-604tt 0-627tt 

0-612 0-496 

0-508 0-611 

-0-003 

0-669tJ 0-664JJ 

0-663 0-648 

0-661 0-656 

+ 0-005 


tt B'rom3d*Zra, 2Jl-P3aiid3i»4*Q2-P3. 

From 3d! “Hi, Pi—P3, and mean IJ^Ai,, R1 — Q2. 


Table 7. Summary oe Q'{m) - P'(m +1 ) ahd JR'(m) - Q'{m +1 ) 

a AND 6 DOTEEEITOES TAKEN SEPARATELY 


0->0 BANDS 


■3d»P, P'1-0'2 

a 

0-748 

h 

0-791 

3dsil„ P'1-0'2 

?0-786 


3d»A„R'l~Q'2 

0-742 

0-773 

means 

0-745 

0-782 

3d*/l* 0'2-P'3 

0-968 

0-037 

3d»P, P'1-0'2 

0-980 

BANDS 

3d»17„ P'1-0'2 

0-972* 

0-046$ 

3d>.d„P'l-0'2 

0-980 

means 

0-977 

0-046 


-0*069 


a 


* 

t 


^om 3(i *i7„ P' 1 — P' 3 = 0-698 and 3d ^A„ Q'2 — P'B — 0-726 

FromM.il. s-l-p-8 = «.714»d mm: i S-eS 


b 


2->2 BANDS 


3d»P, R'I-Q'2 
3d»A„ R'I-Q'2 
means 


3d^A„ R'Z-Q'i 


0-398 [0-416] 

0-371 0-460 

0-384 \0-433) 

-[0-049] 


O-OOet 0-002 i 
+ 0-094? 


3d»17s, C'l-P'2 


3d*/?* R'2-Q’Z 


0-42^ [0-88^) 

+ [0-04o] 

0-716 0-686 

+ 0-030 


3d *17* P'1-0'2 
3d*/)„ P'1-0'2 
means 


3d»/f* 0'2-P'3 


3->3 BANDS 
0-872$ 0-893§ 

0-908 0-911 

0-890 0-902 

- 0-012 

0-778 0-748 

+0-030 


$ J’rom 3d®d7„ P'l —p'3 — 0.660 and 3d*/( 0 ' 2 — P'^ — n -rna 

§ From 3d*JI* P'l-P '3 = o-64i and 3d*/?”o'2-P'3 = 0-748. 



The fine structure of the lines of the hydrogen spectrum 205 

In eack band the value is highest for — Q2 a — Ql — P2 a and falls steadily as we 

proceed through 

51-Q26-Q1-P26 and Q2-PZa-M-QZa to Q2-P3b-M-QSb. 

The average of all four values is highest for the 0 0 bands and lowest for the 3 3 

bands, where it becomes negative. In the 0^0 bands three values are positive and 
one negative, in the 1 1 and 2^2 bands two are positive and two negative, and in 

the 3-^3 bands three are negative and one unknown. 

In the dashed transitions it is likely that something like this occurs with the sign 
reversed, but the data are too scanty to test it satisfactorily. 

(7) The data are very restricted when one wishes to use them to combine the 
results for the dashed transitions with those for the undashed. There is, however, 
one dashed rotational difference which follows right through the vibrational sequence 
0“>- 0 to 3 3, namely, that of the P'l — Q'2 values. If these are compared with the 

next mean 52 — QZ values for the same v'v'" the results shown in table 8 are obtained. 
This shows that, at least so far as this one available example is concerned, when the 
data are combined in this way: (a) the alternations with successive vibrational 
quantum numbers disappear, (6) the gap between the a and b differences is much 
reduced, and (c) it tends to disappear with increase in the vibrational quantum 
number. 

Table 8 



0 

1 . 

1 

-^1 

2 

^2 

3 

—>■ 3 


a 

b 

a 

b ' 

a 

b 

a 

b ' 

5'1-Q'2 

‘14:5 

•782 

•977 

•046 

•384 

[ 

•890 

•902 

R2-QZ 

-029 

•049 

•303 

•362 

•896 

•90, 

•679 

•686 

difference 

0-716 

0-733 

0-674 

0-684 

0-488 

[0-62,] 

0-211 

0-217 


0-017 0-010 [0-03J 0-006 


The discrepant value from the 2~^2 band may arise from the b value for which 
the calculated 5'1 —Q'2 difference and the observed 52 —Q3 are probably less 
exact than the others. 

(8) The value -079 of the decimal part of the 3c?®i7, 0->0, 52 —Q36 difference 
^ (table 6) seems high. It is just possible that the b component of the 52 line is split 
by a perturbation and that the unassigned structure {n) (wave number *052, 
intensity ratio 1: see table 3 of Paper I) is really part of it. In that case the value 
0-079 would be reduced, possibly to as little as 0-043, depending on the exact value 
of the intensity ratios of the components (Z), (m) and {n) of 16834-27 (10) (?. If 
this interpretation of the pattern could be established, important results would 
foUow. 

Having completed what may be termed the empirical analysis of the data, we 
shall now proceed to attack the problem in another direction. 



206 


E. W. Foster and Sir Owen Richardson 


3. The dotjblet separations 

It is evident from tables 1 and 2 that the majority of the lines starting from Ss 
and the complex M 11^^, ^cd ending on are each split with the resolu¬ 

tion of the echelon into two components a and 6, a having the higher frequency and 
also the greater intensity. The evidence is complete enough to be convincing that 
this statement apphes to sJl hnes ending on 2p Where only one component 
appears, this is always due either to the conditions not being sensitive enough to 
make the weaker component visible, or, owing to the doublet separation being 
different for different hnes, in a relatively small number of cases the components are 
not clearly separated with our resolution, the doublets appearing as single lines of 
abnormal width. 

The largest doublet separation we have observed is about 0-22 cm.”^, the smallest 
about one-third of this amount. 

From an observational point of view, the hnes (and doublets) divide themselves 
into two groups according to the properties of the components. We have called these 
the regular and irregular group respectively. There is no very sharp rule of demarca¬ 
tion for every hne, but most of them can be picked out by inspection. The regular 
group has a considerably greater average doublet separation. There are several 
well-authenticated examples with Av not much under 0-22 cm."^, very few have 
less than about 0*14, and it is doubtful if the measures for any of these are to be 
rehedon. Intheirregulargrouptherearenorehabledv’sgreaterthanO'lS, but values 
as low as 0*06 and 0*08 seem weU secured. The two groups also differentiate them¬ 
selves by the values of the intensities of the components. In the regular group, the 
estimated values of I{alb) practically all lie within the limits 3*0 and 1*9, most of 
them close to 2*4. The four values outside the limits 3*0 and 1*9 are probably all 
doubtful. The range of J(a/6) in the irregular group is much greater. There are 
several well-secured values at or close to 6*0 and 0*8 and a few cases, not very 
secure, of values outside these limits. 

It win be shown below that the division into the regular and irregular group can 
be related to the symmetry properties of the states 2p 

Before going further, it will be convenient to have before us a table of all the 
splittings and intensity ratios which have been observed. These, followed by our 
estimated weights, for Av and I{alb)y together with the classified transitions and 
Vq values are assembled in table 9. 

The following discussion is mainly a first-order treatment which neglects any fine 
structure that may exist in the upper states 35 and Zd ^27, and attributes 

all the effects so far observed to the fine-structure components of the lower states 
2p This mode of treatment is justified as a first approximation by the following 

considerations: 

(1) Almost all the lines which so far have been found to have any directly per¬ 
ceptible fine structure are lines which end on 2^3/7^^. The number of others is no 
greater than might be expected to arise from unidentified blends. 



207 


The fine structure of the lines of the hydrogen spectrum 


Table 9. Sputtetgs astd rNTENSiry eamos foe 
3s *2" AND 3do-®2 OOMI*LBX->2 j3 

0'->0" EEGHLAE 


line 

Ql 


-OcQl 

A^, JK1 



Av (weight) 

I(alh) (weight) 

Vq 

0*194 (6) 
2*5 (3) 
16854*86 

0*216 (4) 
2*5 (2) 
17199*28 

?0*185 (1) 
?2*0 (1) 
17212*01 

0*213 (3) 
2*6 (2) 
17672*31 



Line 

dsp, C'2 

i7c, ^'2 

«'2 




Av (weight) 
l(ajh) (weight) 

0*189 (5) 
2*96 (6) 
16764*12 

0*18 (0) 
2*5 (0) 
17248*55 

0*160 (3) 
2*5 (1) 
17467*13 




line 

QZ 



n„Qz 

^i,pz 

A>Q^ 

Av (weight) 

J(o/6) (weight) 
vq 

0*173 (6) 
2*6 (2) 
16654*24 

0*177 (3) 
2*5 (2) 
16898*48 

0*160 (4) 
2*6 (1) 
17199*28 

0*151 (3) 
2*5 (2) 
17261*34 

0-144: (3) 
2-5 (1) 
17271-56 

0-216 (2) 
2-6 (1) 
17637-02 


no = 4 levels 


line TTg, Q6 

Av (weight) ’ ?0*15 (0) 

J(a/6) (weight) ?2 (1) 

Vo 17262*66 

0'-»0" lEEEGTJLAE 


line 

d^Z, R'l 


n^R'i 






Av (weight) 
J(a/6) (weight) 

Vg 

0*146 (1) 
4*5 (0) 
16884*94 

no*2o (1) 

?5 (0) 
17125*07 

>5(0) 

17369-32 

0*129 (2) 
4*0 (1) 
17588*00 





line 

d^Z, P2 

d^Z,E2 

n„P2 

i/e,P2 





Av (weight) 
I(a/6) (weight) 
vg 

0*140 (1) 
?2*0 (0) 
16734*20 

0*114 (3) 
4*0 (3) 
16834*27 

0-172 (3) 
4-0 (3) 
17091-29 

0*169 (2) 
4*0 (0)* 
17441*32 

0*149 (3) 
?4*6 (1) 
17451*59 




line 

d^Z, R'Z 

iJc. *P'3 

A. P'S 






Av (weight) 
/(a/6) (weight) 
vq 

0*12 (1) 
4*0 (0) 
16774*97 

?0*10 (1) 
?1*0 (0) 
17068*60 

?0-16 (0) 

17287-27 






line 

^d^Z, P4 

n,,F4: 

4itf.e4 






Av (weight) 
/(a/6) (weight) 

0*06 (3) 
1*0 (3) 
16710*63 

0*09 (1) 
?1*0 (1) 
17023*67 

?0-08 (1) 
?l-5 (1) 
17627-44 

0*08 (3) 
?0*60 (2) 
17299*30 






* This might be ^2*5; 

1 if g (Part I, table 3) is 

of negligible intensity. 





r-^r 

BEGTJLAR 





line 

s^Z,Ql 

d^Z, Ql 

iTd,i^l 

n„Qi 

Aa,Bl 




Av (weight) 
/(a/6) (weight) 

0*20 (1) 
?2*5 (1) 
16917*50 

0*196 (5) 
2*5 (2) 
16603*20 

0*205 (6) 
2*6 (2) 
16978*52 

0*220 (2) 
?2*0 (0) 
16988*72 

0*190 (6) 
1*9 (5) 
17426*25 




line 

s^Z, Q'2 

d^Z, Q'2 

JJd, B'2 

A,, Q'2 





Av (weight) 
/(a/6) (weight) 

?0*162 (1) 
?3*0 (0) 
16916*57 

0*179 (6) 
2*5 (1) 
16515*45 

0*229 (0) 
2*5 (0) 
16988*72 

0*161 (4) 
2*5 (2) 
17316*56 





line 

s^Z, QZ 

d^Z,QZ 

IIa,BZ 

iTd, P3 

JIoQ^ 

A^ FZ 

A^f P3 

4»„<23 

Av (weight) 
/{a/6) (weight) 

0*18 (1) 
?2*5 (1) 
16916*33 

0*182 (2) 
2*5 (2) 
16414*37 

?0*21 (0) 
3*0 (0) 
16990*71 

0*146 (4) 
2*6 (1) 
16692*00 

0*170 (4) 
2*5 (2) 
17047*15 

0*172 (2) 
?2-5 (0) 
17139*77 

?0*162(1) 
?3*0 (0) 
17591*22 

0-200 (2) 
2-6 (1) 
17367-64 


no = 4 lines 
line P6 

Av (weight) ?0*08 (1) 

J{a/6) (wei^t) Il-O (0) 

Vo 17084-66 



208 E. W. Foster and Sir Owen Riohiirdson 


Table 9 {continued) 


line 

d»2.R'l 

Hi, Q'l 

Av (weight) 

— 

_ 

J(a/ 6 ) (weight) 

> 6-0 ( 2 ) 

— 

Vq 

16630-47 

16899*76 

line 

«»i7, B2 

8^2, P 2 

Av (weight) 

70-226 ( 0 ) 

0-160 (3) 

J(a/ 6 ) (weight) 

73-0 (0) 

6*0 (4) 


17086-67 

16802*37 

line 

d^Z, R'S 

He. P'a 

Av (weight) 

0-12 ( 1 ) 

0*166 ( 3 ) 

Hajb) (weight) 

3-6 ( 1 ) 

3*6 (2) 


16630-23 

16862-22 

line 

d»Z, Bi 

H,, P4: 

Av (weight) 

0-100 ( 3 ) 

? 0*12 ( 1 ) 

lia/h) (weight) 

0-8 ( 6 ) 

? 1*0 ( 0 ) 


16469-93 

16820*29 


line 

Av (weight) 
J(a/ 6 ) (weight) 

d»Z, Q 1 
0-167 ( 1 ) 
73-0 (0) 
16297-26 

ni,Ri 
0-189 (3) 
72-6 ( 0 ) 
16730-66 

line 

Av (weight) 
I(a/ 6 ) (weight) 

d»Z, Q'2 
0-168 (3) 
2-6 ( 2 ) 
16222-72 

Hi, P'2 
0-11 ( 0 ) 
3-0 (1) 
16638-98 

line 

Av (weight) 
I(< 2 /Z») (weight) 

Vg 

d^Z, QZ 
0-166 ( 3 ) 
2-5 ( 2 ) 
16132-09 

Hi,PZ 
0-164 (5) 
2-6 ( 2 ) 
16468-03 

line 

Av (weight) 
Hajb) (weight) 
vg 

dcQ'4, 
0-118 ( 1 ) 
2-0 ( 0 ) 
17218-60 


line 

Av (weight) 

1 ( 0 / 6 ) (weight) 

Vg 

0-15 ( 0 ) 

16332-11 

Hi. Q'l 
0-15 ( 0 ) 
-t:4(0) 
16648-40 

line 

Av (weight) 

J(o/ 6 ) (weight) 

% 

d^Z, P 2 
0-173 (1) 
74-0 (0) 
16187-50 

d^Z, B2 
0-160 ( 1 ) 
74-0 (0) 
16296-02 

line 

Av (weight) 

/(a/ 6 ) (weight) 

Vg 

d* Q'Z 
70-108 ( 1 ) 
73-0 ( 0 ) 
17184-46 

A, S'Z 
70-212 (0) 
73-0 (0) 
17433-60 

line 

Av (weight) 

/(a/ 6 ) (weight) 

Vg 

d^Z. P4 
0-121 ( 2 ) 
1-6 ( 1 ) 
16192-26 

H„P4, 
0-118 (3) 
0-9 (2) 
16617-89 


I'-S-l" lEEBGXJLAB 






0*15 (0) 
6*0 ( 0 ) 
17138*90 




R2 

P2 

n,,p2 


0*118 (3) 
5*0 (4) 
16584*72 

0*106 ( 2 ) 
3*0 ( 2 ) 
16488*15 

0*182 ( 3 ) 
5*0 (3) 
16873*46 

0*118 ( 2 ) 
2 ( 0 ) 
17218*50 

Ae. P'3 




0*103 (2) 
4*0 ( 2 ) 
17144*93 




04 




0*134 (3) 
? 1*0 ( 0 ) 
17364*30 




2'->2" EEGTJLAE 



H^Ql 

di.Bl 



70-124 (1) 
73-0 (0) 
16684-88 

0-186 ( 6 ) 
2-6 ( 2 ) 
17279-01 



^o» Q'^ 

A* B'2 



0-176 (3) 
3-0 ( 2 ) 
17171-27 

0-138 ( 2 ) 
72-6 ( 1 ) 
17348-12 



Hi, P3 
0-164 (3) 
1-6 ( 2 ) 
16763-44 

Hg, Q 3 
0-170 ( 1 ) 
72-6 (0) 
16734-20 

0*166 ( 3 ) 
?2*5 ( 1 ) 
17006*43 

Ae,QZ 
0*150 (4) 
2*5 (2) 
17192*31 


2'->2" IERB6T7LAB 


H„B'1 
70-17 (1) 
72-6 ( 0 ) 
16822-00 

d„B'l 
0-096 (2) 
76-0 ( 6 ) 
17280-68 



H„P2 
0-167 ( 1 ) 
4-0 ( 2 ) 
16576-12 

H„B2 
0-16 ( 0 ) 
>4-0 (0) 
16897-19 

di, Q2 
0-146 (2) 
6-0 ( 1 ) 
17169-22 

4 . Rs 
0-130 (1) 
4-0 ( 1 ) 
17365-14 

4.-P4 




> 0-6 ( 0 ) 
16976-03 





0-138 (2) 
? 6-0 ( 1 ) 
17311*08 


1762M9< 


0-140 (1) 
? 2-0 ( 1 ) 
16934-41 




209 - 


The fine structure of the lines of the hyArogen spectrum 


Table 9 {continued) 






REGULAR 



line 


na,Rl 


A^yRl 



Av (weight) 

0-166 (3) 

0-124 (1) 

0-211 ( 2 ) 

0-182 (6) 



liajh) (weight) 

2-6 (1) 

2-6 (1) 

2-5 (0) 

2-5 (2) 



vq 

16931-94 

16445-80 

16388*32 

17129*89 



line 


As,B'2 

A^y Q'2 




Av (weight) 

0*174 (1) ??0-136 (1) 

0-179 (1) 




J(a/6) (weight) 

?l-0 (1) 

??2-5 (0) 

?2-5 (0) 




Vq 

16466-65 

17186-00 

17026-36 




line 


HayPZ 

27^, 223 


A^, QZ 


Av (weight) 

0-141 (2) 

0-132 (2) 

0-169 (1) 

0-161 (3) 

0-160 (2) 


I{ajh) (weight) 

?2-0 (1) 

?3-0 (0) 

2-5 (0) 

2-0 (1) 

2-6 (0) 


Vg 

16796-05 

16186-63 

16476-10 

16870-70 

17033-21 


line 


A,y Q'4: 




d^HyQb 

Av (weight) 

?0-110 (0) 

?0-128 (2) 




0-114 (1) 

I(alb) (weight) 

^4-0 (0) 

?2-0 (0) 




?3-0 (1) 

Vg 

16825-98 

17040-75 




15610-91 


IBBEGTJLAB 

line JIc,R'l JS'l 

Av (weight) 0-150 (1) 0-176 (1) 

I{alb) {vreight) J3-0 (0) 6-0 (>0) 

Vs 16611-46 [17130-30] 

iae d^S,M2 d»S. P2 /I* <3 2 

Av (weight) 0-136 (2) 0-16 (0) 0-166 (1) 

J(<»/6) (weight) 1-6 (2) 4-0 (0) ?6-0 (0) 

Vg 16949-72 16827-48 17026-41 

line n„ P'S A„ P'S 

Av (weight) 0-141 (1) — 

I(a/h) (weight) 4-0 (0) — 

v& 16261-79 16870-70 

line Aa,Q4: 

Av (weight) 0-167 (1) 

Jiajh) (weight) 1-0 (0) 

Vg [17033-66] 


(2) Tke doublet separations are roughly the same for aU lines going down to all 

levels of having the same v" and K''. The relative intensities of the two 

components are also characteristic of the same variables to the same Mnd of approxi¬ 
mation. 

(3) The same statements can be made regarding ^,11 lines going down to the same 
and of 2p The values for the 2p levels are, however, not the same as 

for the 2^ levels, but the values are characteristic for the respeciave levels. 

(4) We have some evidence, to be described in subsequent papers, of the existence 

of fine structure in states of H 2 other than 2p In none of the eases so far examined 

is it, however, on a scale exceeding one-tenth of that of the fine stracture in 2j3 



210 


E. W. Foster and Sir Owen Richardson 

In this connexion it is as well to remember that evidence of the absence of fine 
structure is often more convincing than evidence of its existence. 

In view of ( 4 ) we shall, however, pay some regard to the possibility of fine structure 
in the upper levels where this can be done fairly simply and in a reasonable compass. 

4. The two grottps 

While the division into a ‘regular’ and an ‘irregular’ group has an observational 
basis, this does not, as we have pointed out, sharply define the contents of the groups. 
To do this, it is necessary to* consider the transitions from Zs^2 and the complex 
U^2, 3 d to and 23 ) 317^. A diagram of these is shown 

in figure 1 (so far as this discussion is concerned the state labelled d ^2 will do equally 
wen for both Bd^2 and Bs^2). Both Bd^2 and Z$^2 are even {2g) states; 

Bd^n^ and Bd^A^ also have this symmetry. Bd^IIa and 3^3^^ are odd (u) 

states. The transitions which go down to 2p are limited to those which satisfy 
the selection rules and AK = 0 or u-^g and AK. = ± 1, those to 2 p 3 / 7 ^ satisfy 

u-^u and d* = 0 or g^u and AK = ±l. Omitting the prefix 3d3, the lines which 
go down.to the three lowest rotational levels of 2p^n^ are 2Ql, 

A^Rl, 2Q% n^Q% naP% M% A,Q% A^B% 2QB, n^QB, H^PB, BB, A^QB and 
A^PZ, BB, whilst those going down to 2^^!!^ are 2P'l, B'l, n^Q'l, A^B'l, 

2P2, R2, noP2, B2, n^Q2, A^B2, AaQ2,2P% B% H.P’B, B% n^Q'B, A^P'B, B'B 
and dj Q'B. It will be seen from table 9 that the transitions down to 2p are just 
those which form the regular group, and the transitions down to 2p the irregular 
group. 



¥i<mm 1. The rotational structure of d^Ucdt polled levels and 

the transitions between them. 













The fine structure of the lines of the hydrogen spectrum 211 

To the extent that the separation of the upper levels can be considered negligible 
all the measured separations will be due to the lower final levels. Within this limita¬ 
tion the properties of the regular group are those of 2p and of the irregular group 
they are those of 2p^n^, 

It will be seen from table 9 that the separations Av are fairly constant for the 
regular doublets. In the 0-^0, 1^1, and 2 2 bands, however, there is evidence of 

a tendency of Zli' to fall slowly as K" increases from 0 to 3 and this is followed by a 
sharp drop at Z*" about 4 or 5. In the 3 -> 3 bands this tendency of Zl y to change with 

seems less marked. For the irregular doublets the A v’s suggest similar tendencies 
but the average separations are much smaller. The reduction of z! v is mainly at the 
low jST" end where the values are highest. 

In the regular doublets the intensity ratio /(u/J) shows little systematic variation 
from about 2*5, but here again there seems to be a suggestion of a slow fall from 
K" — 1 to S followed by a sharper drop at K" = 4 for v" = 0 to 2 with the 3-^3 band 
behaving differently. In the irregular doublets /(a/6) falls from abqut 5-0 to about 
1*0 as X" increases from 1 to 4 at each value of 

There is some difficulty in assessing the true trend of the real values by inspection 
of a compilation like table 9, as there is a very great variation in the reliability of 
the different determinations. This is shown by the fact that the weights, both of the 
Av"s and of the liajb), cover the whole possible range (0) to (6) assignable. 

We have endeavoured to eliminate this by formiog a weighted mean of each 
determmation of the same quantity by weighting them in the proportion of the 
weight assigned in table 9 and giving the result a weight equal to the sum of the 
original weights. The final weights will give some sort of basis for comparmg the 
weight of evidence behind all the data. The result of this is given in table 10. We will 
leave the numbers in this table to speak for themselves. 

5. Infliten-oe of the tjppee states oh the doublet sepabattoh 

If there were no fine structure in the lines, the energies of the initial and final 
states could be expressed respectively by F'{v', ¥) and where v\ K' and 

v", K" are the vibrational rotational quantum numbers of the levels. As we have seen 
the final states at least are double so that each K") has two values which we 
may denote by Ffiy^K") and Fl<Ff, This wifi double the number 

of lines, each single line now being replaced by a doublet with separation Ff — F''. 

Confining discussion to the simplest case likely to be detectable with the resolution 
at present known, suppose the upper levels are also double with energi^ expressed 
by F"^{v'K') and F'fiv'K'), Ff> F'. Each of the previous doublets will now become 
another doublet, the whole forming a quadruplet whose components havefrequenoi^, 
omitting the p'’s and F^’s for brevity, (1)F/—{2)F'~F^,{3)F/~F/, (4) J^-F^in 
the order of v. decreasing. With the present resolution, this quadruplet will appear 
as a doublet, and the mean frequenci^ of the two components will depend on the 
transition probabilities between the upj^r and lower fine-structure levels e and /. 



212 


E. W. Eoster and Sir Owen Richardson 


Table 10. Mean splittings and intensity ratios 
POE TRANSITIONS TO 2jJ®i7 



0 

1 

2 

REGULAE 

3 » 0 

1 

2 

3 

mean in* 
tensity 

1 

{IS) 

^ splittings 

0-200 (25) 0-181 (14) 

0-176 (16) 

2-4 (12) 

intensity ratios 

2-2 (15) 2-3 (6) 

2-5 (8) 

ratio 

2-3, 


0-204 (14) 

0-199 (20) 

0-182 (10) 

0-178 (12) 

2*4, (8) 

2-2 (10) 

2-5 (2) 

2-5 (4) 

2*3, 


0-204: (10) 

0-199 (15) 

0-186 (6) 

0-181 (8) 

2-5 (4) 

2-1 (6) 

2-5 {!) 

2-5 (1) 

2-3, 

2 

0-178 (11) 

0-174 (14) 

0-158 (12) 

0-163 (6) 

2-8^ (10) 

2-5^ (7) 

2-h (10) 

1-75 (4) 

2-60 


0-178 (8) 

0-170 (10) 

0-163 (8) 

0-163 (3) 

2*8a (7) 

2-5 (3) 

2-75 (6) 

1-0 (1) 

2-6, ■ 


0 -m (6) 

0-171 (7) 

0-165 (5) 

— 

2-95 (5) 

2-5 (1) 

Z-S (1) 

— 

2-3, 

3 

0-168 (27) 

0-171 (24) 

0-158 (25) 

0-149 (15) 

2-5 (15) 

2-5^ (15) 

2-3 (IS) 

2-0 (7) 

2-3, 


0-168 (21) 

0-169 (16) 

0-157 (19) 

0-149 (10) 

2-6 (9) 

2-5 (7) 

2-28 (9) 

2-0 (2) 

2-3, 


0-167 (15) 

0-167 (9) 

0-157 (13) 

0-147 (5) 

2-5 (3) 

2-5 (2) 

2-25 (d) 

— 

2'rf() 

4 

— 

— 

0-118 (2) 

0-121 (5) 

— 

— 

2-0 (1) 

3-0 (2) 

2-3, 


— 

— 

0-118 (1) 

0-122 (3) 

— 

— 

— 

— 

— 




— 

0-128 (1) 

— 

—* 

— 

— 

— 

5 

0-15 (1) 

0-08 (2) 

0-140 (2) 

0-114 (2) 

2-0 (2) 

1-0 (1) 

2-0 (2) 

3-0 (2) 

2-2, 


— 

0-08 (1) 

0-140 (1) 

0-114 (1) 

2*0 (1) 

— 

2-0 (1) 

3-0 (1) 

2-33 


IRRBGXrLAR 


1 

0-15^ 

(7) 

0-15 ( 1 ) 

0-134 

(7) 

0-163 

(4) 

4-5 (5) >5-0 

(4) 

5-3 

( 8 ) 

4-0 

(2) 

4-9 


0-15 

(4) 

— 

0-121 

(3) 

0-163 

(2) 

4-2, (2) >6-3 

(2) 

6-0 

(6) 



5-4 


0-129 

(1) 

— 

0-096 

(1) 

— 


— 

>6-0 

(1) 

6-0 

(4) 


— 

5-8 

2 

0-149 (17) 

0-141 (22) 

0-154 (12) 

0-148 

( 6 ) 

3-9i ( 12 ) 

4-6 (21) 

4-2 

(9) 

2-2 

(5) 

4-1 


0-149 (12) 

0-138 (15) 

0-153 

(6) 

0-145 

(3) 

4-0, (7) 

4-8 (14) 

4-2, 

(4) 

1*5 

(2) 



0-148 

(7) 

0-140 (9) 

0-145 

( 1 ) 

0-135 

( 1 ) 

4-0 (4) 

4-8 

(9) 

4-0 

( 1 ) 

1-5 

( 1 ) 

4-3 

3 

0-12 

(S) 

0-135 (9) 

0-143 

(3) 

0-141 

(2) 

2-5 ( 2 ) 

3-7 

( 8 ) 

3-0 

(2) 

4-0 

( 1 ) 

3-4 ^ 


0-11 

(2) 

0-137 (6) 

0-108 

(1) 

0-141 

(1) 

— 

3-7 

(6) 

— 


• - 


3-7 . 


— 


0-145 (3) 

— 


— 


— 


(2) 


- 

- 

- 

3-7, 

4 

0-07^ 

(J2) 

0-117 (10) 

0-119 

(7) 

0-167 

(2) 

7-0, ( 11 ) 

0 - 8 ^ 

( 8 ) 

0-9^ 

(S) 

1-0 

( 1 ) 

0-9, 


0-074 

(8) 

0-117 (7) 

0-119 

(6) 

0-167 

(1) 

0-9, (7) 

0-8 

(5) 

1-1 

(3) 

- 


0-94 


0'07^ 

(4) 

0-117 (4) 

0-119 

(3) 

— 


0-8, (3) 

0-8 

(4) 

0-9 

( 1 ) 

- 

- 

0 - 84 , 

For each K‘ 

^ and three values of spHtting and three values of intensity ratio are given. 

The middle 

values 


are obtained when the normal weightings for these quantities, given in table 9, are used. The top values are 
those obtained when the normal weights are increased by 1 and the bottom values those obtained when the 


normal weights are decreased by 1. Thus, the top values are weighted means of all observations good and bad 
whilst the bottom values include the best observations only—i.e. those with weight ^2. These changes in,the 
method of averaging have little effect on the final result. 


Table 11 


regular levels irregular levels 



3d»JT4, 


3dM^, 

3d»d^, 

3dm„ 

3d®J„ 


V' V" 

i21,P3 

P1,P3 

P'2, P'4 

P3, P5 

R', P'3 

P'1, P'3 

P2, P4 

0 ' 0^ 

+ 0-039 

— 

— 

— 

— 

0*031 

+ 0-079 

1 / 

+ 0*059 

+0-018 

— 

? +0-072 

-0-016 

— 

?-0-002 

2' 2^ 

+ 0-035 

+ 0-021 

— 

— 

— 

— 

+ 0-032 

3' r 

0-008 

+ 0-031 

?+0*026 

— 

+ 0-009 

— 

— 



The fine structure of the lines of the hydrogen spectrum, 213 


If the transitions are e-^f, e->e and/->e,/->/, the transitions of each pair having 
eqnal probability the observed doublet will have components of frequencies 


\{rf-Fl + r,-Fl) and l{F'f-F''j + F',-F}) 


and the doublet width will still be Ff — Fl{a). If, however, the transitions are 
f:^e tlie component frequencies are Ff-Fl and F'^-Ff and the doublet 

width Ff — Fl’^Ff— F'^{b); if they areand *^^^5 the component frequencies 
are Ff—Ff and F'^ — Fl and the doublet width Ff—F'^—{Ff — F") (c). In general 


the doublet width observed can have any value between ( 6 ) the sum of the separa¬ 
tions due to the energy differences of the initial and final levels and (c) the differences 
between these separations, according to the probability distribution of the transi¬ 
tions e^f and ./->/. 

The lesson of this for present purposes is that if one wishes to get beyond a first 
approximation in the treatment of the energies of the fine-structure levels of 2p 
the effects of the fine structure of the upper states must be eliminated, as these may, 
possibly, be comparable with the changes in the 2 j 9 ^i 7 states which are under 
investigation. 


6. The VARiiiTioN of Av with Z" 

The magnitude of the doublet width of any of the lines under consideration can 
be put in the form F^ — FI + a{Ff — F'^), where a is a function of the transition prob¬ 
abilities from the fibae-structure levels of the upper to those of the lower state, 
a may be either positive or negative. We assumed that it is independent, to a 
sufficient approximation, of the difference of the quantum numbers of the final 
states F" to which the expression is appKed, Otherwise, the following argument 
, requires reconsideration. 

In this notation the doublet width of any E{K) line, for example, may be written 
AvR{K) = F'}{v\K)^F:{v\K)+a{F'^{v\K+l)^F',{^ 

then 

AvP{Ki-2) = F"f{v%K-h2)^Fl{v%K + 2)+a(F'y{v\K + l)-F'^{v\K+l)) 

and 

AvR{K)^Av{P{K+2)) = F}{v\K)--PI{v'\K)--(F;{v\K+2)^FI{v\K+2)), 

is the change in the width of fine-structure levels of the final state at v = v'" as 
changes to jK”— 2 . There is not a great wealth of material to apply this to. What ^ 
there is is set out in table 11 . The regular iJl — P3 values which are almost complete 
show a steady falling off of the separation as v" increases from 0 to 3, the mean 
values, for what should be the same quantities, being: at 0->0, +0-039, at 1->1, 

+ 0 - 0383 , at 2 ^2, + 0*028; and at 3-> 3, + 0 * 0115 . The values (oneeaeh)ofi2'2—P'4 
and P3 - P5 show that the It{X) — P{K + 2 ) differences increase with increasing Z'". 


Vol. 189 . A. 


14 



214 E. W. Foster and Sir Owen Richardson 

The figures for the irregular levels point to a much greater separation of the fine- 
stracture levels at R2P4: than at jB'lP'3, and suggest that the increase of separation 
with increasing K" extends to these levels, that is to say to also. 

The data are not very different from what would be obtained if the energy of the 
levels giving rise to the fine structure consisted of two parts, one proportional to 
the rotational energy of the molecule and the other independent of it. The rotational 
energy will be roughly proportional to If this were exact, the energy differences 
for PI, PS:R% P'4:P3, P5 would be as 2:3:4. For 3', 3", P1P3 = + 0 - 0115 , 
which would give a computed value for 3', 3", P'2P'4 of +0-0173 as against the 
observed ? + 0-026 and from 1', 1" PI, P3 = + 0*0385 we calculate 

r,l"P3-P5 =+0-077 

as agaiust ? + 0*072 observed. These agreements are as close as the experimental 
results would justify expectation, and the data would hardly warrant an attempt to 
refine the theory further. Frequency differences such as those of PI — P3 eliminate 
the effects of differences in the energies of the fine-structure levels which do not 
depend on K and so disclose nothing about their magnitude. 

Something may be said about this if one turns back to the doublet separations Av 
and forget about their differences for the time being. Actually these are greatest 
for the smallest K values (table 10) and diminish as K increases. If the part of the 
energy difference of the fine-structure levels of the upper level which is independent 
of P is denoted by a' and the corresponding quantity for the lower level by a", 6'(P') 
and b"{K'') similarly denoting the parts which depend on K, then 

Av{B{K)) = a' + 6'(P+l) + a" + 6"(P), 

^i;(P(P)) = a'+6'(P-l) + a" + 6"(P), 

= 6'(P+1)-6'{P-1), 

4i;(P(P-l)) = a'+6'(^) + a" + 6"(P-l), 
dKP(P+l)) = a' + 6'(P)+a" + 6"(P+l), 
4K5{P-l))-dv(P(P + l)) = 6"(P-1)-6"(P+1). 

An examination of the data shows that the 6'(P +1) — 6'(P — 1) are mostly nega¬ 
tive, i.e. 6'(P)>6'(-S^+2) ^d the 1)—6"(P—1) are mostly positive, i.e. 

6^(P) > 6'(P+2). This at any rate seems to be what happens at v\ v" = 0 or 1 where 
the data are most plentiful. At higher the diminution due to the increasing 
numerical value of the b term seems to be counteracted by somet hing else, and in the 
irr^ular group there is no clear sign of any falling off of the doublet separation 
with increasing P in the 33 bands. 

It thus appeam that the change with P for the upper levels is on the whole in the 
same sense as that for the ®iT levels. We hope to deal with this more fully in a 
later paper. 

If the irr^ular levels of the 3 3 band be excluded, then it is found that the total 

doublet separation is greatest at the lowest P^ level (P^ — 1) and then diminishes 



The fine structure of the lines of the hydrogen spectrum 215 

as K" increases, slowly at first and then more rapidly. The part a = a' -f a" of the 
doublet width which is independent of K is the extrapolated width at K'" = 0, and 
is thus a little greater iu magnitude than the width at K" - 1. At K" = 0, a'-fa" 
is the whole width and at JT" = 1 it is about 95 % of the whole. There is evidence 
that a' + 6' (1) does not exceed about 5 % of the whole, so that a' is almost certainly 
within a few percent of the average -width at E" = 1 in table 10. Since the numerical 
value of both b"{K) and b'{E) increase as K increases and the total vndth diminishes 
as K increases, it follows that b'\K) (and b\K)) have the opposite sign to a". It 
appears also that at jS^ = 5, b"{E) is about ^a". 

Comparison with the helium band spectrum 

The only band spectrum to exhibit a fine structure similar to that of Hg is the 
spectrum of Heg. This He2 fine structure was discovered by Monk & MuUikenin 1929. 
The resemblance between the two is very close indeed. The most obvious differences 
are due to the fact that in the Hcg spectrum the alternate dashed lines in the Hg 
spectrum are missing owing to the zero value of the spin of the helium nucleus. 
Thus in Heg the number of band lines is half the number in Hg for the same K range, 
and there is nothing to disclose the behaviour of the dashed group of lines. The 
observations and measurements for Heg were aU made on 0-> 0 bands. 

As in Hg the only lines in He2 for which the doublets are observed as clearly 
resolved are those which have 2p as the lower state. These fall into two groups, 
denoted by A and B, corresponding to our regular and irregular groups. In all 
cases the component of higher frequency is the more intense. For the A (regular) 
group the average doublet distance at v" = 0 falls from 0*36 at AT = 1 through 0*27 
at jE = 6 to 0*22 at X = 9. This corresponds to 0*204 at = 1 through 0*168 at 

= 3 to about 0*16 at jS" = 5 for Hg at v" = 0 (table 10). The value for Heg starts by 
being nearly twice that for Hg at = 1, and they both faU off in about the same 
relative proportions as K increases. For these doublets, Mulliken & Monk (1929) say 
the intensity ratio was about the same, about 2:1 to 3:1 as estimated,visually. 
Our mean for at v" = 0 is 2*45 (table 10). For the B (irregular) levels in Heg only 
the lowest {K"" = 2) levels were resolved, the higher X" = 4, 6 and 8 levels appearing 
as symmetrically broadened lines. At the K" = 2 level the mean Av for Hoa was 
foxmd to be 0*21 cm. Our mean values for Hg at = 0 fell from 0*15 at K" = 1 to 
0*074 at == 4 (table 10) so that the sphtring in Hg is probably closely parallel to 
that in Heg on about half the scale. The rapid contraction of the doublets at == 0 
is not observed in Hg at higher v'^. For Heg the intensity ratio of the resolved K" — 2 
doublets is given as from 3:1 to 6:1, Our measures of the intensity ratio of the 
corresponding doublet for Hg at v" = 0 give the weighted mean value 4*07. 

Mulliken & Monk give a valuable discussion of the theory of these fine structures 
which is particularly interesting where it concerns the in-bensities of the components 
(Monk & Mulliken 1929; Mulliken & Monk 1929)- According to this, which is based 
on a theory of Epamers (1929), the energy differences depend entirely on the inter¬ 
action energy of the spins and s^ of the two outer electrons, which in -tke -taiplet 


14-2 



216 E. W- Foster and Sir Owen Richardson 

states combine to form the spin quantum number of the molecule /S = 51+^2 = 1. 
For each K the associated S can take the values 0 ± 1 giving three close triplet levels 
with J = K,K±l. Since the weights of the states are = 2J +1 the intensities of 
the components will be in the ratio 2 K + 3: iK +1: ' 2 K — 1. In states, according 
to Kramers’ theory, the states with J = K ±1 have almost identical energy, and 
this is attributed by Mulliken & Monk to the 211 ^ (A or regular) states. They are 
supposed to be so close together as to be completely unresolved and to be lower 
than thf! JST level, thus giving the observed doublet. The ratio of the intensities of 
the combined lower levels to the upper level is ( 2 K +3 + 2 K — 1) ( 2 K +1) = 2 for 

all £. For the 21 T^ (J5 or irregular) levels which are resolved into doublets the J = K, 
K +1 sublevels are so close as to be unresolved and the J — K — 1 level lies above 
them. For the ratio of the intensities of the components of the resolved doublets, 
all of which have E" = 2, should therefore be 7 + 5 -r 3 = 4. All these intensity 
predictions are in agreement with Monk & Mulliken’s observations on the Hej lines 
within the limits of accuracy they seem to claim. 

Remembering the very close similarity between the fine-structure results of the 
Heg and Hj spectra and in view of the striking successes of these intensity relations 
in Hcj one is tempted to carry the theory over bodily from Heg to H2. At first sight 
this seems to be crowned with success, for our H2 intensities make 1 (a) 11 (b) for the 
regular group almost independent of aU the variables and closer to 2 than to 3, and 
the value of I (a) 11 (b) for the irregular group at K" = 2 is 4-2g when averaged over 
v" for 0-3 and actually 4-07 at v* = 0 against the predicted 4. However, a closer 
inspection does not place the matter in quite such a rosy light. 

In the first place, the discrepancy between about 2-4, our value of I(ajb) for the 
A group, exceeds 2 by more than we should have expected. It is true that there are 
great difficulties in measuring these intensities and their ratios exactly. It may be 
that our methods are exaggerating the /(a/6) ratio in favour of the stronger com¬ 
ponent. 

A more serious difficulty arises at the B levels. If the ratio is 4 at the K = 2 level, 
on the same basis at the K = 1 level it should be ( 2 E +3 -f 2K-|- 1)/(2K — 1) = 8. 
Our highest single measured photometric value for the irregular levels is 6-1, This 
is for the fine 16802-37 (10(?) 35®2'->-2p®//, l-vl, P 2 . This particular line is difficult 
to measure accurately on our plates owing to a very heavy background. Our final 
assessment of /(a/6) for this Kne, taking into account another (lower) photometric 
measure and some visual estimates, is 4 to 6 (4); the weight (4) means that a value 
between 4 and 6 is fairly certain, it does not mean that 6 is more likely than 4, but 
it does imply that the most probable value is close to 5. For the B levels at K" = 1 
we have no photomicrometric measures. These are difficult owing to the weakness 
of the 6 components of the composite lines none of which have much intensity. Of 
the eleven eye estimates for B'l, Q'l, P'l fines of the 3s^I'-^2p^JT and the 
®/r, systems the three highest have /(a/6) ^ 5 (0), > 6 (0) and ?6 (6). None 
of the other eight exceed 5 and the weighted mean of aU the eleven we put at 6-4. 
A case could be made out for its being closer to 6 even than 6-4. 



217 


The fine structure of the lines of the hydrogen spectrum 

It seems very improbable that /(a/6) for the J2'l ^'1 P'l lines can be as high as 8. 
It is true that there is not a photometric confirmation at this level, but this method 
has been checked against visual estimation for a number of undashed hues ending 
on each of the levels K" = 1,2 and 4 and on the dashed lines ending on K" = 2. 
There is no drastic change in the conditions which would warrant any expectation 
of an error approaching a factor of 2 in changing firom these to the dashed lines 
B'l, Q'l, P'l. 

This co-ordination of the results, both for Hg and H^ 2 > could be made consistent 
by increasing all the J values by This would not affect the calculated value 2 of 
/(a/6) for the A levels, since the ratio 2{K + l-t-^)-fl-t-2(X^—H-|)+l I2{K -f ^) 4-1 
is equal to 2 whatever quantity may be added to or J. On the other hand, the 
calculated/(a/6) for the £ levels (2(Z-f 1 + i^)-i 1-t2 (Z-t-|) + l)/2(Z-l + i)4-l) 
becomes, for each K, 

K 1 2 3 4 

I(a/b) oalo. 6 3-6 3 2-76 

I{a/b) exp. H, 6-4 4-2g 3-7 0-9g 

At all K except K = 4, the observed values for Hg are aU a little higher than the cal¬ 
culated ratios just as was the case for the A values. The sharp drop to about 1*0 at 
Z = 4, which seems to be real enough, maybe due to a relative reorientation of the 
sublevels similar to that postulated by Monk & MuUiken to account for the con¬ 
traction of the Av’b at higher K" values in Heg at-v" = 0. This also sets in at Z" > 4. 

No doubt the close similarity of both spectra means that the explanation of their 
fine spectra is fundamentally the same for both Heg and Hg. For each, a consider¬ 
able body of facts has been assembled but not enough to cover aU the possibilities, 
Further consideration of these seem to be premature and we shall postpone it for 
the present. An opportunity to return to it may perhaps occur in later papers in 
which we hope to consider the possibility of fine stiructure in levels of Hg states other 
than 2,p^n^. 

We wish to express our thanks to the Department of Scientific and Industrial 
Research and to the Royal Society for the provision of financial aid which enabled 
this work to be carried out. 


RbI’ERBU-OBS 

Gale, H. G., Monk, G. S. & Lee, K. O. 1928 Asirophys. J. 67, 89. 

Kxamers, H. A. 1929 Z. Phya. 53, 422. 

Kapuscinsky, W. <fe Eynaers, J. G. 1929 Proo. Boy. 80 c. A, 132, 58. 

Monk, G. S. & MuUiken, R. S. 1929 Nature, 124, 91. 

MuUiken, B. S. & Monk, G. S. 1929 Phya. Bev. 34, 1630. 

Richardson, O. W. 1929 Trana. Faraday. 800 . 25, 691. 

Richardson, 0. W. 1930 Proo. Boy. 800 . A, 126, 601. 

Richardson, O. W. 1934 Moleouiar hydrogen and ita apectrum. New Haven: Yale University- 
Press; London: Oxford University Press. 



218 


E. W. Foster and Sir Owen Richardson 


Richardson, 0. W. & Davidson, P. M. 1931 Proc. Boy. Soc. A, 131, 658. 

Richardson, 0. W. Sc Davidson, P. M. 1933 Proc, Boy, Soc, A, 140, 25. 

Richardson, 0. W., Davidson, P. M., Marsden, J. Sc Evans, W. M. 1933 Proc, Boy, Soo, A, 
142, 40. 

Richardson, O. W. Sc Rymer, T. B. 1934 a Proc, Boy, Soc, A, 147, 24. 

Richardson, 0. W. &; Rymer, T. B. 19346 Proo, Boy, Soc, A, 147, 251. 

Richardson, 0. W. Sc Williams, W. E. 1931 Nature, 127, 481. 

Sandeman, A. 1932 Proc, Boy, Soc, A, 138, 395. 


Radiative equilibrium in the ionosphere 

By R. V. D. R. Woolley 

{Communicated by S. Chapman, F.R.S.—Received 20 April 1946) 

The equations of radiative equilibrium in the earth’s atmosphere are examined with special 
reference to high temperatures in the ionosphere arising out of a radiation balance between 
heavy ultra-violet absorption in the ultra-violet and relatively weak infra-red emission. It is 
suggested that while molecular oxygen is the principal ultra-violet absorption agent at 
heights below 250 km., at greater heights the absorption is mainly due to atomic oxygen. 
Similarly, it is suggested that while water vapour is the principal infra-red radiator at a 
height of lOOkm., at much greater heights water vapour is absent and the temperature is 
effectively controlled by negative ions at a height of 250 km. and perhaps by dust at much 
greater heights. 

The gravitational equilibrium of a dissociating oxygen atmosphere is discussed, reference 
being made to the effects of coUisioxxs and of absorption of the dissociating radiation. 

1. The generally accepted theory that the tipper layers of the earth’s atmosphere 
are in raxiiative equilibrium was put forward by Gold (1909). Gold’s theory was 
discussed by MUne (1922) in a paper to which, detailed reference will be made later. 
In the same year Lindemann & Dobson (1922) produced observational evidence 
to show that the temperature increased considerably above the stratosphere. 
Gowan (1928) worked out a heat balance between absorption of ultra-violet sunlight 
by ozone and emission of temperature radiation by water vapour, and showed that 
temperatures of order 300° 0 were to be expected. In a later paper, Gowan (1930) 
gave temperatures between 250 and 600° C at a height of 80 km., depending on the 
relative amoimts of these two constituents of the atmosphere. A few years later it 
was suggested by various authors that at greater heights the temperature attained 
much greater values. Thus Angenheister (1932) studied the heights of polar aurorae in 
summer and winter and concluded that if the winter temperature from 100to200 km. 
was 300° C, the summer temperature was 1200° 0. Appleton & Naismdth (1935) 
made a sunilar deduction from seasonal variations in the critical frequency of the 
region. Martyn & Pulley (1936) found that relative collision frequencies in the 
FI and F, regions could only be explained by a temperature of order 1000° C in the 
atmosphere between these regions, and put forward the hypothesis that this high 



219 


Radiative equilibrium in the ionosphere 

temperature persisted all the year round. Shortly afterwards Godfrey & Price {1937) 
found theoretical temperatures of order 1000° 0 by working out a heat balance 
between infra-red water vapour emission and ultra-violet absorption by molecular 
oxygen (as well as ozone). These authors followed Gowan’s method, which considers 
a single layer, without working out the temperature, as a function of height from 
the equations of transfer of radiation, as was done by Milne. 

The present paper develops equations for the temperature as a function of the 
height in a form suitable for distinguishing between the various molecular processes 
of re-emission of light following upon absorption. It is pointed out that dissociation 
of oxygen affects the heat balance profoundly, and expressions for the dissociation 
of molecular oxygen as a function of height are worked out. The paper is concerned 
with heights above the ozone layer and a discussion of the formation of ozone is 
exphcitly avoided. 

2. In any atmosphere, let I^{d)dvd(i) 14 c 7 r be the flow of radiation per sq.cm./sec. 
in frequencies between v and in directions falling within a cone of soHd angle 
do) molined at an angle d to the vertical, which is the axis of x taken positive upwards. 
Consider the passage of this radiation through a cylinder of unit cross-section and 
of length ds — dx sec 6 . The radiation will lose by absorption the amount 

h^IviO) pdsdvdo)j4tTT, 

where is the coefficient of absorption per unit mass of the atmosphere. The energy 
which is removed in this way may be merely scattered, or it may be converted into 
kinetic energy of the particles of the gas, or it may be emitted in frequencies outside 
the range from vtov+dvhj fluorescence. Let the fraction of the energy absorbed 
he scattered, the fraction '>}„ be converted into kinetio energy, and let the fraction 
be transferred to other frequencies, where = 1. 

Now examine the emission with the cone do) in frequencies within the range v 
to v+dv by the cylinder. If the scattered radiation is isotropic, the amount scattered 
into do) is 

i^kypda dvcUojirT x 

where is the average value of 1 ^( 6 ) over a sphere, or The absorp¬ 

tion Tj^k^ is balanced by thermal emission. The balance is exact in thermodynamical 
equilibrium, and the amount emitted depends only on the kiaetio energy of the 
molecules of the gas. If the molecules have a Maxwellian velocity distribution 
appropriate to some temperature T, then the emission is 

vf^k^pdsdvdoyjisrr x B{v, T), 

where B{v, T) is the intensity of equilibrium (black-body) radiation, 

= 87rAv®c~®(e*’'/®^ — 1)“^. 


The symbols h, S stand for Planck’s constant and Boltzmann’s constant.. 



220 R. V. d. R. Woolley 

Finally, there wiE be some emission into the frequency range to by 

fluorescent transfer of energy from other frequencies. Liet this be 

Summing up the gains and losses to the flow Iy{d) dvdojjiiT on account of its passage 
through the cylinder, then 

cos^l^ = -IM+Uv+VvS{v, T)H.K- (2-1) 

Now define two further quantities and by 

47r£f„ = cos ddb), 43iEy = oos^ ddb), 

both integrals being taken over a sphere. is the flux of radiation in the direction 
X and K„ may be thought of as the potential of this flux. Then averaging equation 
(2-1) over a sphere. 


hypdx 


= - J,(l - Q T) + C 


Multiplying equation (2-2) by 1 c„dv and integrating, then 

writing H forjfT^civ. If there are no sources or sinks of energy in the atmosphere, the 
net flux is constant (although is not so), and hence 

= ^VvKBiv, T)dv+^^,lc,B,dv. 

Since fluorescence only transfers energy from one frequency to another 

j^^k^B^dv = j^„k^J,dv. 

Hence, when there are no sources or sinks, 


jij^kyjydv = T)dv. 


This is the equation of radiative equilibrium which determines the temperature T 
of the atmosphere as judged by the kinetic energy of the particles. In the remainder 
of this paper it will be assumed that the velocities of the particles are distributed 
according to Maxwell’s law, and the parameter T in that law will be referred to as 
the temperature of the atmosphere. 



221 


Radiative equilibrium in the ionosphere 


Multiplying equation (2-1) by cos 0 and integrating over a sphere, then 


k^pdx 


(2-4) 


(It naay throw some hght on the nature and importance of the quantity Z = j 

r dK 

to remark that V^Z = —h^p div Z + I p div H dv, so that if there are no sources 

or sinks of energy and divZ = 0, then V®Z = 0. In most applications, however, 
there is either plane or spherical symmetry and the equation reduces to simpler 
forms.) 

Advance has been made in the treatment of radiative equilibrium in the solar 
atmosphere by assuming a relation between Zy and based on some simple assump¬ 
tion about the distribution of I^{6) with 6. For example, if IJ^O) = when 0 < d < Jtt, 
and if IJ^d) = 1^ when ^<6<7i, then Z, = \J^. Again, if 1^(6) = a4-6cos0, once 
more Zy = The solution of the simplest astrophysical problem is as follows. 
Consider an atmosphere in which there are no absorption lines, so that Sy = 0 and 
ignore fluorescence, so that ^y = 0. Then 9/ = 1. Further, suppose that ky is in¬ 
dependent of V and equal to k and dr = — kpdx. Then using Z = it follows that 


J = 3Zt-!- constant. 


the boundary condition being no inflow of radiation at the surface of the star or 

Jjy(0)d(y = 0 over a hemisphere equal to zero where t = 0. This gives J = 2Z at 

T = 0 according to one approximation (1^(0) constant over a hemisphere) and J = 
according to the other {Iy{6) = a + boosd). Adopting the former, then 

J = Z(2 + 3t). 

This gives the temperature, since J = B(T) = acT* and 

T‘=TJ{l-h|T). 

This equation is due to Milne. It receives support from the observed darkening of 
the solar limb. 

The approximation Zy = \Jy has had further successes in accounting for the main 
features of absorption lines, and fiirther mathematical, approximation has shown 
that in some cases at any rate the errors involved in assuming that Z„ = |Jy are 
small. * 

3. It may be asked whether this approximation is of any value in dealing with 
the earth’s atmosphere. In this case it is convenient to assume that the flow of 
radiation has the intensity Iy(5) in aU directions except in a cone of solid angle 4m(a 
(the cone subtended by the sun’s*disk) in which the intensity is ry{n—z)+8y where 
z is the sun’s zenith distance. Write 



222 E. V. d. R, Woolley 

all the integrals being taken over a sphere. Then 

J, = J'„+(oSy, (i>(S,cos 2 , = jST'+£ t>iSp 0 os* 2 . 

The radiation outside the special cone may be supposed evenly distributed according 
to some simple law such as /^(d) = a+6cos5, so that it may be supposed that 
K'y — \Jy. It is at once clear that in the special case sec® z = Z, Ky — also. It is 
also clear that Ky = cos^zJ^, will be a better approximation than Ky = \Jy if direct 
sunlight preponderates over a scattered sunlight. 

In what follows the intensity Iy{d) will be used, defined by 

Iy{6) = ry{d) outside the special cone 

Iy{6) = ry{7T—z)+8y msido the special cone ino). 

A distinction of this kind was made by MUne ( 1922 ). 

By definition, 4ara) is the solid angle subtended by the solar disk. According to 
Auwers the mean solar diameter is 31' 69", which gives o) = 6'410 x 10 ”®. Let p be 
the radius of the sun, and let R be the radius of the earth’s orbit. Let Ry be the 
emission per sec./sq.cm. of the sun’s surface. The total emission from the sun is 
then inp^Ry. Since aU this radiation passes through a sphere of radius B the amount 
which passes through 1 sq.cm, normal to the sun at the earth’s mean distance per sec. 
is .Ry. But this is where 8^ is the value of 8y at the top of the earth’s 

atmosphere. A^ain 47 rw = rrp^lW, so that 

n8t = R„. 

The value of Ry is not known in the far ultra-violet regions of the spectrum in 
which present interest lies. In the visible spectrum and in the infra-red, however, 
it is known that the distribution in frequency of solar radiation is close to that of a 
black body at 6850° (Plaskett), while the effective temperature is known, more 
reliably, to be 5740°. If a colour temperature of 6826° be assumed (=* 6740° x 2*), 

then Ry = i)-i^ 

where T' = 6826°. This gives ^Rydv = ^cT'* = with = 6740. Now 

and since for ultra-violet radiation is much greater than 1 , then 

Si = (3-1) 

wliere p = 6-991 x 10 “^® and log^ = 15.8446. This value of yff is applicable to the 
visible spectrum. As it is likely that the solar absorption coefficient increases with 
increasing frequency, it is probable that in the far ultra-violet the solar radiation 
is less than is ^ven by (3-1) with T = 6825°. In particular, it is very hkely that 
j there is a drop in the continuous spectrum at the head of the Lyman series at 912 A. 



223 


Radiative equilibrium in the ionosphere 


4. In this section an atmosphere is considered in which there is neither scattering 
nor fluorescence, that is to say, ^ ^ = 0 and ij = 1. The case in which is indepen¬ 

dent of V was treated by Milne. It has no application to the earth’s upper atmosphere 
in which has large values in the in&a-red and ultra-violet and is practically zero 
in the visible spectrum. Now pass on to a mathematical case which, though far too 
crude to give an accurate representation of the actual atmosphere, illustrates several 
points, and in particular the reversal of the temperature gradient. Let = k^ 
from Vi to Vj^+Avi (infra-red frequencies), and let k„ — k^ from to v^+Av^ (ultra¬ 
violet frequencies), and let k„ be zero for all other frequencies. Then from (2-1) 

cos^^^ = -kM + k^B(y^, T). 


In the earth’s atmosphere T will be so small that B(Vz, T) may be neglected in 
comparison with (in the ultra-violet frequencies). In this case 


pdx 


— — k^J^, 



The ultra-violet light is almost entirely direct sunlight so that = cos^zJg, and 

d^J 

the equations give cos^z = MJo., or 


*4 = '^2 ® 2 :p —1^:2 seczj pds ^, 


which is ordinary esqponential absorption. Now in radiative equilibrium with no 
sources or sinks of energy it follows from (2-3) that 

T)}+kiAvzJ2 = 0 . 

Write Av^ = XAvi^ and k^ — aJc^ and let c?t = — k^pdx. Then 
^ = Ji-jB(vi, T) = 

= J^+AcoszJ§e-“rs««®. 

Notice that H = H-i^Av.i^ +H^Av^ is constant, as it must be if there are no sources or 
sinks. 

In the infra-red, temperature radiation is more important than direct sunlight 
and K-y = \Jy. Then • 

dJJdr = 3ffi = 3j4-t-3Acosztf2e~“’'®®°*, 

which gives Jy = Jl + 3Ar + 3a“^A cos^ zJ %-f e“*’’®®®®(aA — 3a~^A cos®z). 

But T) = Jy-VccXJ^. 

-- - J —= 3.4 -t- A sec zJ\ e-“’'Beo*(3 cos® z — a®). 


Hence 



224 


R. V. d. R. Woolley 

Now at the lower boundary of the radiative region (the tropopause), which is taken 
as T = Ti, in&a-red radiation is given out by the earth so that has a positive value, 
say Then 

3^ = J.+AcoszJge"®’'!®®®*, 

it may be supposed that no ultra-violet radiation reaches the lower boundary and 
jOg-aTiseoa may be neglected. This gives A = 3^, and accordingly 

= -Z3^- A mQzJ%{pt? - 3 cos® z) 

If a is large 3cos®z may be neglected compared with a®. Then the temperature 
gradient changes sign at t = r', where 

3 ^ = ^Aa®J2secze““’’'®®®®. 

The boundary condition at the top of the atmosphere is no inflow of infra-red 
radiation, or (The solar infra-red radiation has in intensity of order 

(aT’jT compared with the earth’s radiation.) Then /$ = 2(,3f+AcoszJ'§). But 
B(Vi, T) = Jj-t-aAJa so that ifT = 2oatT = 0 

2 OOSZ). 

The temperature at the point where the gradient is reversed t = t' is Tmin. where 

Bik, ^mta.) — 2’^(i+1'^^ + 3a~^secz)-H Ae72(2 cosz-t-3a~^ COS® z). 

For a large these expressions become 

B{v^,To) ^ 23e+aXJl ( 4 - 1 ) 

B{vi, = 2^(1 +|r') + 2A oos®z/§, (4-2) 

giving the temperature at the top of the atmosphere and the minimum tem¬ 
perature is supposed known and = (oS^. Equation (4-1) is very gimrlgr 

to M^e’s equation (28"), from which it differs slightly because Milne’s approxima¬ 
tion is not quite the same asE^ = , and also because an internal boundary has been 
used in deriving (4-1). The use of this internal boundary shows that a TniniTrinTYi 
temperature can occur in an atmosphere in. radiative equilibrium. It should be 
noti<^ that Milne’s statement that cos a > ^ (in our notation cos z>^Ba,),ia most 
applications, is correct if one is comparing heavy infra-red absorption with an almost 
transparent visible spectrum; but the introduction of still heavier absorption in the 
far ultra-violet leads to a large with, of course, very different consequences. Milne’s 
use of exponential absorption for direct sunlight and his use of (24) and (26) for 
earth radiation correspond to oiir approximation = coa^ zJ^ and 

6. One cannot proceed from (4-1) to an evaluation of the actual temperatures 
which will be encountered in the atmosphere because k, is not even roughly inde¬ 
pendent of V in the far ultra-violet. Consider first the ultra-violet absorption due to 
molecular oxygen O 2 . The absorption is in the Schumann-Runge continuum and is 



Radiative equilibrium in the ionosphere 226 

absorbed by the ground state It results in dissociation of the molecule into two 

atoms, one in the ground state ®P and one in a metastable state thus 

02(a:»i:) +hv-^ 0(®P) + 0(iD). 

The dissociation potential of O 2 is 5-09 V and the excitation potential of 0(^D) is 
1-96 V. Hence the excess of energy over 7-05 V (corresponding to = 5*7 x 10^ cm.~^) 
absorbed in the Schumann-Runge continuum is converted into kinetic energy. 
Turning to the potential energy, the most likely transitions for the 0(^D) atom are; 

I. Ordinary recombination 0(^D) + 0(®P) -^02+ hv, contributing to 

11. (a) Super-elastic collision 0(^D)+X->0{®P)+X + ^mj'®, contributing to 9/. 

(6) Super-elastic collision 0{^D) + X0(®P) -t- X', contributing to 1 /. 

X'-^X + hv 

III. Forbidden transition 0(^D) ->■ 0(®P) -1- hv, contributing to 

Of these the forbidden transition is by far the most likely when the number of 
particles per c.c. is 10^* or less, as the mean life of the metastable state is only 0’6 sec. 
Accordingly, scattering is negligible: most of the energy absorbed is transferred to 
other frequencies by fluorescence and hence 

1 = 0, 7i = {v-Vq)/v, (5-1) 

Now work out the temperature distribution in an atmosphere in which absorption 
is performed by molecular oxygen, leaving atomic oxygen out of , account for the 
moment. Define a mean absorption coefficient for infra-red frequencies by 

kB{T) = jkyB{v, T)dv, 

the integral being taken over the infra-red, and T being that of some representative 
level in the atmosphere. Strictly speaking over the infra-red frequencies 

is not necessarily equal to but it will be supposed that this is so: then the equation 

of radiative equilibrium (2-3) gives 

f Ti^h^J^dv ^ 0, (5-2) 

J Vo 

where Vq is the Idw-frequency limit of the ultra-violet absorption. 

The absorption coefficient of O 2 in the Schumann-Runge continuum has been 
measured by Ladenburg & van Voorhis (1933), according to whom it increases 
almost linearly from v = 6*9 x 10^ cm."^ to v = 6’9 x 10^ cm.”"^, decreasing sharply 
thereafter. For present purposes take = a{v-^v^), A rather cumbrous calculation 
allowing for the decrease in beyond 6*9 x 10^ cm.'"^ shows that little error is intro¬ 
duced by this simple assumption: the physical justification is that the solar energy 
curve weakens very rapidly with increasing frequency. 



226 


R. V. d. R. Woolley 


Now the energy from sunlight in the Sohumann-Runge continuum once absorbed 
is re-emitted in other wave-lengths so that no radiation field is built up comparable 
in intensity with direct sunlight. Hence = sec®zJJ, and thus 

Jy = (i)Sl exp {—k^seczjkr}. 

l*co ^00 ^ 

Then ojhc^^ \ a[v—VQYv^exp{—pv—amczlh.{v — v^)T}dv, (5*3) 

J Vo J Po 

or with sufficient accuracy “ 

^00 ^00 

= cKuvS’^/S®, a:2exp{—/?(l+/6T)a:}da: 

J Vo Jo 

= WO/(VoAl+/*T)®). (5-31) 

Here [i = aBeozj(kfi). Then from (5-2) and (6-31) it follows that 

= e^+C(1 

where G = 2ao)S^J(kVf,fi^). 

If an analysis similar to that in § 4 is carried out, assuming again that no ultra¬ 
violet light reaches the lower boundary t = t^, then 


B{T) = je{2 + BT)+^^(^ + l-{l+/iT)-^+^{l+p^T)-^y 

B(0) = 2je’+O{l+/i-^). 

Or since ii is large jS(0) = 22^-\- C. (6*41) 

It may be remarked that, if the approximation from (6-3) to (6-31) is not made, a 
more elaborate series (going up.to inverse fifth powers) is obtained in place of (6-4) 
and, with ^Vq = 12-4, in place of (6-41) one obtains 

R(0) = 2X+1-660. (6-42) 

It is convenient to express the constant 0 in terms of k', the absorption coefficient 
when = 1. Then fik' — a and hence 



^2a^. 
k ’ 


' (5-5) 


To evaluate ^, let r be the radius of the earth, p the radius of the sun and B the 
mean radius of the earth’s orbit. Let the total emission from the sun be 4mp‘^S ergs/sec. 
Of all the radiation which leaves the sun, the fraction nr^j 47 Ti?® is intercepted by the 
earth. Of this, the fraction 0-37 is reflected as visible light (according to Abbott), 
while the remainder is converted into longer wave-lengths and distributed over the 
day and night hemispheres. Hence the average infra-red emission per sq.cm, is 


= 0-632: X p^l4=B^ = O-QBxwZ. 

Now 27 = where a is the coefficient in Stefan’s law and is the effective 

temperature of the sun or 5740°. Hence 


= 0-63(y‘x^Cjr*. 


( 5 - 6 ) 



Radiative equilibrium in the ionosphere 227 

But B{T) = acT^, so that if B{T) = where/is some factor, 

T = ?;x(0-315a)/)i 

= 207^’5x/i (5-7) 

Again, from (3*1), (5-5) and (5-6), 

(7 fc' 

^ ~ I ^ 0-316^ao^T'3’ 


since 2^'^ == 27^'^ has been assumed. Inserting numerical values and using (5*42), 


it is found that 


T = 207°-5 X {14- 7*9 x 


(5-8) 


If k'lTc is of order 10^, T is of order 2000°. 

The infra-red absorption due to water vapour has been measured by Hettner and 
also by Weber and Randall. The results do not agree, and as Hettner^s observations 
are far more extensive they must be used. The coefficient varies irregularly with the 
wave-length, and Hettner’s results do not extend far enough into the infra-red to 
give reliable results for temperatures as low as 300°, A rough estimate from Hettner’s 
results as reproduced in Brunt’s Physical and dynamical meteorology gives k = \5w 
at 2000° and k = ZOw at 1000°, where w is the proportion of water vapour by weight 
in the atmosphere. (A very rough estimate suggests k = lOOw; at 300°.) The value of 
k* as measured by Ladenburg & van Voorhis ( 1933 ) ^ 10®^', where w' is the 

proportion of molecular oxygen by weight in the atmosphere. 

While the value of w' presents no diJEculty apart from dissociation, the value of w 
is most uncertain. On the earth’s surface the water-vapour pressure is about 10 mb., 
but at the mimmum temperature in the stratosphere (about 220°) the saturation 
pressure is only about 0*026 mb. This occurs at about 20 km., where the pressure of 
oxygen is about 27 mb., so that at this level there is about one molecule of water 
vapour to every thousand oxygen molecules and the ratio by mass is one to two 
thousand. If this ratio persists upwards the boundary temperature given by (6-8) 
is about 1000° 0. 

If, however, water molecules can pass the barrier at 20 km., possibly in the form 
of ice crystals, the water-vapour population may build up above this rather low 
limit. It does not appear impossible to ascertain the total water-vapour content 
above the stratosphere by actual experiment in the near future, by exposing a 
spectrogram in an aircraft flying at a great height.* If the above estimate of the 
water-vapour population is too low by a factor of 10, the boundary temperature 
drops from 1000° 0 to about 600° C. (As the temperature lowers, the value of k 
becomes less certain.) 

By (6*4), near t = 0, 2^oc (l-f/tr)”"*. Roughly speaking the temperature at a 
given level is inversely proportional to the mass of molecular oxygen above that 
level if the ratio of number of water-vapour molecules to number of oxygen molecules 
is constant. The temperature drops to one-half the boundary value below 2 x 10^*^ 


{Note added in 1946.) Or released from a high altitude rocket. 



228 


R. V. d. R. Woolley 

molecules, when the sun is in the zenith. The results of later sections may be 
anticipated by sayiag that the results of this section are thought to apply roughly to 
heights from 100 to 200 km. 

6. At very great heights the oxygen will be dissociated to atomic 0 which will 
absorb ultrarviolet radiation capable of ionizing the atom in its ground state, that 
is to say of wave-length below 912 A, thus 

The excess energy over 13-64V (the ionization potential) appears as kinetic energy 
of the electrons. These wOl rapidly distribute their velocities according to a Max¬ 
wellian law, and the recombination spectnun will be distributed in frequency 
according to the molecular-velocity temperature of the (earth’s) atmosphere. 

It may be inquired what equilibrium would result if the atmosphere consisted 
entirely of one element, moreover, one which exhibited only one continuous spec¬ 
trum. Ikom each quantum hv absorbed, the excess over the ionization energy 
h{v — Vq) is imparted to the electrons as kinetic energy. The electrons will by collisions 
distribute their kmetic energy according to a Maxwellian distribution characterized 
by some parameter T. The emission spectrum arising from the capture of electrons 
with this velocity distribution will have the same frequency distribution as tem- 
perattire emission for T, so that the emission per gram of the neutral element will 
be XkyB(v, T)dv, where A is some constant. 


Hence = 1c^I,{d)-^Xk^B{y,T), 

(6-11) 


(6-12) 

and integrating (6'12) with respect to v, then 


= -jhJAv+X^h^Biv, T)dv. 

(6-13) 

But H is constant so that 


j h^J^dv = A f 1CyB{v, T) dv, 

J Vo J Vo 

(6-2) 


the equation of radiative equilibrium in this case. 

If the electron gas has no means of exchanging energy with the atmosphere except 
through transitions in this one continuous spectrum, one can equate the fraction of 
the energy absorbed which goes to electron velocities with the fraction of the 
capture spectrum which d iminis hes electron velocity energies, or 

=A T)dv. 

Jn V Jv, V 


(6-3) 



RaMative equilibrium in the ionosphere 229" 

If now /y is (i)8{v, T'), where (o is some fraction and 8{v, T') is a temperature dis¬ 
tribution corresponding to a (solar) temperature T', ( 6 - 2 ) and (6-3) can only be 
satisfied if A = o and T = T'. The re-emission is exactly the same as if the process 
were true scattering in a bound-bound transition: but the electrons, if they have 
no means of exchanging energy with the atmosphere except in this one band, 
must take up a velocity distribution appropriate to the colour temperature of 
the sun. 

There will, of course, in any real atom be more than one continuous spectrum. In 
the case of O where ground state is ®Po,i ,2 ffie^e are excited states and with 
energies of 1-96 and 4-17 V. At sufficiently low dilution factors (o the forbidden 
transitions between these states and the ground state will become important, and 
the atom will transfer radiation from the ultra-violet band 0(®P)^0'*'(*S) to the 
bands 0(^)^0‘‘'(^S) and 0(^D)?^0'*'(*S). The velocity distribution of the electrons 
win not be reduced unless high-velocity electrons show a greater tendency to fall 
with an excited state (rather than the ground state) than slow electrons. 

If the temperature of the upper boundary of the earth’s atmosphere falls much 
below 6000“, there must be some infra-red radiators, and if water vapour is dissoci¬ 
ated at very great heights (see § 13) search must be made elsewhere for emission of 
low-temperature radiation. Two possibilities present themselves, which will be 
examined in succeeding sections. These are negative ions (§ 7) and dust (§ 8 ). 

7. Although negative ions with their low dissociation potentials are very greatly 
dissociated into atoms and electrons in suxilight there will at all times be a small but 
finite negative-ion population. Since the continuous spectrum of the negative ion 
lies in the visible or infra-red, a very small negative-ion population may have a 
great effect on the temperature if there are no other low-frequency radiations 
available. 

Let n~ and n be the numbers of electrons, negative ions and atoms per o.c. 
Then in dilute solar radiation the populations will be given by the modified Saha 
formula 

^ = 2{27rmB)i^^e-xinr,,F{T, T), (7-1) 

where m is the ratio of the statistical weights of the atom and negative ion*, A is the 
dissociation potential, T' the solar temperature, T the temperature of the atmo¬ 
sphere, and FiT, T') is a factor which depends upon the form of the absorption 
coefficient of the negative ion as a function of frequency. In particular, if is 
proportional to v®, F{T, T') = T'T^. The subject is discussed very fuUy by Panne- 
koek ( 1930 ). 

For given values of % and n we have n~ proportional to The infra-red 
emission per o.c. of such ions is proportional to 

J Vt 


Vol. 189. A. 


15 



230 R. V. d. R. Woolley 

where hv^ = 'x^. and the result of this integration will depend on the variation of 
with V, but the most important term will be proportional to Hence the 

emission per c.c. varies with 

exp{xlBT'-xm^ 

and if T' is considerably greater than T the emission is greater for small values of x 
than for large ones. Thus the ion N” whose dissociation potential is estimated at 
0-04 V is more important than 0" whose potential is between 2 and 3 V. 

As rig decreases with increasing height, the ratio rTIn decreases with increasing 
height, according to (7-1), so that in the limit where interstellar space is approached 
the influence of the negative ions disappears. However, at heights of 200 to 300 km. 
the negative ions will have a profound effect. 

It is not possible to work out the radiation balance accurately because the pro¬ 
portion of nitrogen to oxygen, and the state of the nitrogen (whether molecular or 
atomic), is uncertain at great heights, and also because the absorption coefiGlcient 
of negative nitrogen ions N” and are unknown. If has a very low dissociation 
potential and an absorption coefficient nearly equal to that of N", it will make no 
difference to the calculation whether the nitrogen is in the molecular or atomic 
state, given the ratio of nitrogen particle to oxygen atoms. It will be supposed 
that negative ions showing a very low dissociation potential and exhibiting an 
absorption coefficient independent of frequency and equal to 4-5 x 10~^® cm.® (the 
absorption coefficient of 0) have to be dealt with. (It is known that the absorption 
coefficient of 0” is of order 10~^® cm.®.) 

How if is independent of v and if is small, the function F{T, T') in (7'1) 
becomes Assuming a solar temperature of 6826° (for the visible spectrum) 

and an atmospheric temperature of 1000° for nitrogen it is found that 

^ = 4-7x10“ (7-11) 

ITow assume that the atomic oxygen population is equal to the nitrogen population, 
and assume values of n and Then temperatures may be calculated from 

n ,acT^ = n[ (ohv\v--VQ)c~^e’^^^l^^'dv, (^*2) 

J Vo 

Some results are shown in table 1. In computing this table T' has been taken as 
6000° for the spectrum beyond 912 A. This is very uncertain, and the values of T 
are very sensitive to the adopted value of 


Table 1. Tembeeatures by eqtjilibeium between 

NEGATIVE NITROGEN IONS AND OXYGEN ATOMS 


log n assumed 

10 

8 

6 

log assumed 

5*7 

4-7 

3-7 

log n- from (7-H) 

0*0 

-3*0 

-6*0 

T, from (7-2) 

1000 ° 

1570° 

3100‘ 



231 


Radiative equilibrium in the ionosphere 

Table 1 shows that the negative ions'probably play a very important part in 
controlling the temperature. They are able, on the figures assumed, to radiate 
enough energy, compared with the intake of oxygen atoms, to pull the temperature 
down to 1000° or so. But as each oxygen molecule absorbs 4 x 10® times as much 
energy from full radiation at 6000° as an oxygen atom does, the negative ions would 
play no effective part in cooling a concentration of lO’^® oxygen mokmles per o.c. 
A discussion of the change-over from atomic to molecular oxygen with decreasing 
height in the atmosphere is given in § 14. 

If it be supposed that the absorption coefficient of the negative ions is 4*5 x 10~^® 
and the molecular velocity temperature 1000°, and that there are 10“ relevant 
neutral particles and 10®"’ electrons per c.c., each neutral particle will capture an 
electron once in about 18 days. Accordingly, since the neutral particles carry nearly 
all the kinetic energy, and can only lose it by capturing an electron and radiating a 
quantum, the atmosphere will not sensibly change its temperature from day to 
night. To obtain a day and night average, the temperatures shown in table 1 must 
be multiplied by 2~i, sO that the value for n= 10“ becomes 840°, etc. 

It should be noticed that the absorption of ultra-violet energy is attributed to 
oxygen and not nitrogen. The nitrogen molecule does not appear to have a strong 
absorption continuum, and the nitrogen atom has an ionization potential of 14-40 V 
‘ against 13-64 V for atomic oxygen, so that the nitrogen atom is the less efficient of 
the two in absorbing solar radiation. 

8. Dust particles, if present in sufficient numbers, can cool the gases in the upper 
atmosphere. Suppose that there are N spherical dust particles per c.c. each of radius 
r and let nrW = <t. Then if there are n gas molecules per c.c. each of mass M the 
total number of molecules which strike dust particles per c.o./seo. is 



the average kinetic energy being 2BT (swifter particles striking more often than 
slower ones). As usual T is the parameter which characterizes the Maxwellian velocity 
distribution of the gas. If the dust particles are maintained by their radiation balance 
atsometemperatrrre T', the gas molecules will rebound from'the dust with an average 
kinetic energy 2RT", where 

T-T" ==a{T-T'), 

and a is called the accommodation coefficient. Hence the net interchange of energy 
per o.c./sec. from the gas to the dust is 

and if A? is the energy absorbed per c.c./sec. by each molecuje from the radiation 
field, and if the dust is the principal cooler, then 

/2BT\i 

^ = 4a(ri?(^j(T-n. (8-1) 


15-2 



232 


R. V. d. R. Woolley 

The value of o' may be estimated from the brightness of the zodiacal light. According 
to Elvey & Roach ( 1937 ) the brightness of one square degree of the zodiacal light at 
a point on the ecliptic 90° from the sun is equal to 200 stars of the tenth magnitude. 
Let the albedo of the dust be e. Then 


e<r = 5 X 10“^^ x 


where ^ is a factor depending on the law of distribution of o' as a function of the 
distance from the sun, having the value of 0-67 if o' is inversely proportional to the 
distance of 1-25 if it is proportional to the inverse fourth power, so that one may 
assume ^ = 1 . The value of the albedo may be estimated from that of the moon 
(0-07) and of Ceres (0-06) father than from the albedos of planets which have 
atmospheres. Taking e = 0‘07 and ^ = 1 , then o' = 7 x 10 “^®. Now it is easily seen 
that in the upper atmosphere the intake of solar energy by oxygen atoms is unable 
to heat the dust materially so that T' = 233° (by day). If it be supposed that oxygen 
atoms constitute the fraction / of the total population which conveys energy from 
gas to dust and differences of mass in the gas particles be ignored, then 


I 


and E can be calculated from the absorption coefficient for oxygen. 

The resultant values of T ard highly dependent on the assumed temperature of 
the sun in the ultra'-vdolet, and on the values of /, a and cr. With / = i, a = i, 
(r = 7 X 10 ~“ and a solar temperature of 6000° we get T = 4000°. With these figures 
the dust is less efficient as a cooler than negative nitrogen ions at a level in the atmo¬ 
sphere when n = 10 * atoms per c.c., according to the preceding section. 

It should, however, be emphasized that bpth calculations may be substantially 
revised when relevant coefficients and conditions become better known. In default 
of other infra-red radiators, however, one or other of these types of particle, or 
possibly both acting at different heights, must determine the molecular velocity 
temperature at great heights in the earth’s atmosphere. 


9. It is not possible to proceed further without examining the dissociation of the 
molecules in the atmosphere into their constituent atoms as a function of height. 
In full equilibrium at a temperature T the dissociation of a homonuclear diatomic 
molecule is given by 



G 

2hrl 






(9-1) 


where %, % 8 ^^© the numbers of atoms and molecules per c.c., M the mass of each 
atom and Q, and x constants, the latter being the dissociation potential. 
This expression is derived, for example, in Rosseland’s Theoretical Astrophysics, 
where values of the constants for various molecules are given. For oxygen, G = 27, 
rg = 1-20 X 10-® cm., andx = 6-09 V. Then from (9-1) -with T = 1000°, nUriz = 0-319 
and for T = 300° the value is 3 x 10 ~®^. Just as Saha’s equation leads to Milne’s 
well-known relation ( 1924 a) between the continuous absorption coefficient of an 



233 


Radiative equilibrium in the ionosphere 


atom and the probability that an ion will capture an electron, so equation (9*1) 
(which is called the isochore) leads to a relation between the absorption coefficient 
of the molecule and the probability of recombination between two atoms, which is 
mentioned here because the oxygen absorption coefficient already quoted in earlier 
sections leads to a mean life of the oxygen atoms of the order of 25 days with 

= 10 ^^, showing that dissociation may safely be averaged over day and night. 

The effect of departures from thermodynamical equilibrium on ionization (which 
is very similar in this respect to dissociation) was first examined by Woltjer and 
applied to stellar atmospheres by Pannekoek ( 1926 , 1930 ). The same ideas have been 
applied specifically to dissociation by Majumdar ( 1938 ). The form of the result will 
depend on the variation of with v. If one supposes, for example, that is pro¬ 
portional to — (where hv^ = %) for aU v> Vq and zero for v<Vq^ then (9*1) 
becomes 


% ^^0 \^/ 


(9-2) 


where T' is as before the solar temperature. With T' = 6826° and (o = 6-41 x 10 ~®, 
then for oxygen, with T = 1000 °, nljn^ = 2-73 x 10 ^® and with T = 300°, 
n\}n^ = 1-66 X 10^’, in great contrast to the values given by (9-1). Except at the 
top of the atmosphere, the radiation is not simply dilute temperature radiation, 
but is weakened by absorption by day and is absent at night. Since the life of the 
dissociated atom is large compared with 24 hr., the dissociation may be calculated 
by taking an average over day and night, and as regards absorption, the value of 
n\jn^ must be multiplied by an appropriate factor before this average is taken. 

Ozone will be ignored and attention confined to heights above the ozone layer. 
Besides radiative recombination and dissociation consideration will be given to the 
effects of three-body collisions of type 

02 + i[:?:i0 + 0 + Z. 

If is the number of X particles per c.c. the number of dissociations per c.o./seo. 
due to collisions is proportional to and the number of recombinations is pro¬ 
portional to n\n^. Then if the day and night variation of dissociation is small secular 
equilibrium follows, or 

= nl{a+E?^n^), 

the summation being over aU possible types of third body X. q is the number of 
dissociating quanta absorbed per second by the molecule. In thermodynamic 
equilibrium the collision processes balance, or 

{n^Eb^n^)! = {nlE}^n^)T, 

so that if dilute radiation greatly increases the value of over then 

nlEX^rig. is much greater than n^Eb^n^. Under such circumstances the dissociation 
equilibrium equation is 


n^q = n\{a+EX^n^. 


(9-3) 



234 


R. V. d. R. Woolley 


This may be compared with a formula used by Wulf & Deming ( 1936 ) in calculating 

the dissociation of oxygen in the atmosphere. Their equation ( 8 ) in the present 

notation is m a\ 

n^q - nlZA^n^, (9-4) 


which is clearly applicable in regions where the recombination through collisions 
is more important than radiative recombination. It is clear, however, that (9-4) 
will give too small a molecular concentration when becomes small—that is, at 
great heights. 

10 . Now consider the gravitational equilibrium of an isothermal dissociating 
atmosphere in which there is no mixing by vertical transport. Choosing the axis 
of X vertical and positive upwards, the equation of hydrostatic equilibrium is 


dp = —gpdx, (lO-l) 

where p is the total pressure and p the total density. Consider an atmosphere con¬ 
sisting only of 0 and Og reacting with one another. Let M be the mass of the oxygen 
atom and let there be oxygen atoms and oxygen molecules per c.c. Then 

p = p = 2 » 2 ). 

If there is no absorption of the dissociating radiation and if collisions may be 
neglected, then 

qn^ = anl, (10-2) 

where q and a are constant. These conditions apply to the top of the atmosphere. 
Equation (lO-l) becomes 

BT = 2»a). 

But from ( 10 ' 2 ) qdn^ = ^ani^dn^. Accordingly 


dwiCg-f 2 a%) RT 
dux 

and writing RTjMg — H, then 


— Mgdx{q + 2mx), 
~-^-dx 


tlx = == (10'3) 

The constants A and £ are related by q£ = aA^. Each constituent is distributed 
vertically with height exactly as if the reaction did not take place. This result 
appli^ to any number of reacting constituents. It may be proved in the case of full 
equihbrium from a general theorem due to Willard Gibbs ( 1875 ), and the result was 
applied to gravitational equilibrium in a reaction of type ( 10 - 2 ) with qja constant 
by Milne (19246). The residt, of course, does not hold if nx and are connected by 
the more general relation 


tiiiq+Eb^n^) = nl{a+i:X^n^), 



235 


Madiative equilibrium in the icmosphere 

unless (1) there is full equUibrium or (2) collisions are much less important than 
radiation (equation (10-2) above) or (3) collisions are much more important than 
radiation. In the earth’s atmosphere case (2) applies to great heights and case (3) 
to levels near the ground. The transition is examined in the next section. 

11. The term q is weakened by absorption as one goes downwards in the atmo¬ 
sphere, but to obtain a solution it will be supposed that q is constant until the 
transition from atomic to moleculax oxygen has been passed. With q constant a 
simple solution of (10-1) and (10*2) is obtained if the third bodies X are the oxygen 
molecules and atoms themselves, and if they are equaEy efficient in three-body 
collisions, for then (10-2) becomes (9-3), in this case 

= »|(a-i-A(%-t-»2))- (11*1) 

The solution of (lOT) and (ll-l) is 

%:,(!-Xq-HD-i = Ce-^l^, qn^ = (o-l-A»ii) (11-2) 

For % small, then % = and as in {10’3). As x decreases, 

% increases until it approaches the value asymptotically. At low levels is 
approximately equal to 

Now qja is equal to the value of riling in (9-2), and q is known separately from the 

is the number of oxygen molecules 

per o.c. at ground level. Hence the distribution of oxygen molecules and atoms in 
the atmosphere is known if A is known. The number of oxygen atoms per c.o. at 
ground level is not given correctly by these equations, as absorption wiU weaken q 
and finally reduce it to the level where the collision term (omitted from (11 ■ 1)) 

is important. Thus the ground value of % is given by (9-1) with the ground tem¬ 
perature of the atmosphere. If the absorption of the ultra-violet radiation only 
becomes important below the level where collisions have reduced the oxygen to the 
molecular state, the values of wUl be correct throughout the atmosphere and 
those ofcorrect in and above the transition layer. It is, however, wrong to suppose 
that atoms are as efficient as molecules in absorbing energy in three-body collisions. 
The molecule with its vibrational and rotational degrees of freedonj offers far greater 
possibilities of absorbing energy through resonance. Two cases arise: {a) where the 
molecules of the substance itself are the most important contributors to ZX^n^ or 
ZX^n^. = A» 2 , and (6) where the molecules of another gas are the most important 
contributors, or 

ZX^v.j, — Xn'e-^l^. 

^ In ease (a), which presumably applies to nitrogen, the equation (9'3) becomes 


absorption coefficient. Further, |--t- 


qtiz = nlia+Xtiz), 



236 K. V. d. R. Woolley 

■which, together ■with (10-1), has the solution 

a _ 

I l + 2aTO|-;ttra| l W+i«)+a-/md W+z*)^ ijj 

l-pinl )\Aj{cc^+/^)-oc+/inJ 

Tig = TiKa + z^Tig), 

where ajq = a, /i = aA and JD is a constant of integration. 

Case (6) applies to oxygen with nitrogen molecules as the third bodies. Strictly 
speaking the value of H is not the same for the two molecules, but recognition of this 
produces analytical complications in expressions which are already cumbrous. 
Ignoring this point (9*3) becomes 

g^Tig = Ti|(a + 6e“^^/^). 

It is convenient to write 2anjq = y. Then the solution of (11-4) and (10*1) is 

ay^ = 2{i7~ln(l+2/) + 2/~M6e-2^/-^, (11*5) 

Tig being derived from y by (11-4), and E being the constant of integration. It is not 
obvious that (11-4) and (11*5) are essentially similar to (11*2), but in (11*5) as one 
goes downwards y approaches the value y = t/i, where 

ln(H-yi)-yi+|2/! = 

At the top of the atmosphere -where y is small 

y = (2AJ?)*e-“/®, 

and at the bottom of the atmosphere 

Although the actual atmosphere is not isothermal, calculations for an isothermal 
atmosphere at iT = 300° probably represent the first 160 km. of the atmosphere -well. 
From Wulf & Deming’s estimate ( 1938 ) of the collision factor b may be taken in 
(11'4) at 10“^®. K the solar temperature applicable to 2000 A be estimated at 6300°, 
then g = 6 X 10“® quanta per sec. per molecule for oxygen. With T' = 6300 and 
T = 300 then qja — n\jn^ — 4-9 x 10^® from (9-2). Hence a = 1-26 x 10“®^. Values 
of log»i and logjij computed from (11-4) and (11-6) are sho-wn in table 2. These 
Talues are plotted in figure 1 and joined by full lines. The pecked lines sho-w the 
distribution of 0 and O 2 at great heights if the temperature is 1000° at those heights 
and if the value of % for 0 is 2 x 10^® (§ 14) and the dotted fine sho-ws values obtained 
in § 12 (table 3). In § 5 it -was seen that the Schumann-Runge absorption is reduced 
to one-half belo-w 2 x 10^’ molecules. With the conditions assumed, this will occur 
rather above 100 km., and the assumption that q can be taken constant requires 
modification. This -would result in greater molecular and smaller atomic populations 
than those shown in table 2. This effect -will be increased by temperatures higher 
than 300°. 




237 


Radiative equilibrium in the ionosphere 

It is less easy to estimate the dissociation of nitrogen as the spectrum of N 2 is 
not so well known as the spectrum of O^. On accormt of its greater dissociation 
potential (9 V) is much less dissociated than Og. * 


Table 2. Dissociation oe oxygen in an isothermal atmosphere 


H, km. 

83 

98 

110 

122 

133 

170 

206 

242 

278 

lognj 

13-23 

13-16 

12*99 

12-69 

12-39 

11-39 

10*39 

9*39 

8*39 

lOgWs 

14-10 

13-15 

12-17 

10-90 

9*69 

6-19 

4*09 

2*09 

0*09 



I'lGtm.E 1. Distribution of O and Oj with height. -Isothermal atmosphere at 300° C 

(table 2).-temperature increasing from 300 to 1000° C (table 3).-values based 

on collision frequencies (§ 14). 

12. As has been seen in earlier sections, above 100 km. or so (the limit to which 
the Schumann-Eiunge radiation reaches) the temperature very probably increases, 
in a manner depending upon the distribution of the sources of ultra-violet absorption 
and infra-red emission. To examine the effect of the temperature gradient on the 
distribution of population with height, assume as a first approximation that the 
atmosphere consists of only one constituent and that the temperature gradient is 
proportional to the population at any level. Let there be n particles per c.c., each 
of mass M. The p = nM and p = nBT. Now it has been supposed that 

RdT = crgpdz, > (12*1) 

where cr is some constant. In addition 


dp = —gpdx. 


(12-2) 




238 


R. V. d. r’. Woolley 

These conditions are supposed to hold from a: = % to a: = oo. Let T = and w = 
at a: = x^, and let BTJMg = Hi- Then the solution of (12-1) and (12-2) is 

(cr» + 1 ) T = ((r%+ 1 ) 
a: —a:, IT ,\ , nT 

-gJ+(jf-l) + (<n..+ l)lnj^.O. 

Suppose, for example, that % = 10 ^® and = 300° at x-^ = 100 km., and that 
= 15 km. Let us choose the value of cr so as to make T tend to the limiting value 
1000°. Thdh the temperature and population will be distributed with height as is 
-shown in table 3. 

Table 3. Tempeeatuee ikoeeasing unieoemly 

WITH NXJMBEE OE PAETICLES ABOVE 100 KM. 


logn 

13*0 

12-4:8 

12-0 

11-48 

11-0 

10-48 

H, km. 

100 

112 

140 

187 

222 

282 

T 

300 

589 

810 

935 

977 

993 


13. On account of its importance as a source of infra-red radiation, water vapour 
requires special discussion. The equilibrium between N water-vapour molecules per 
c.c, on the one hand and n hydrogen atoms and n' oxygen atoms per c.c. on the other 
is giy^^ by a formula similar to (9*1), namely, 

N - nH'K{T), 

and by the Willard Gibbs-Mihie argument aU the constituents are distributed 
vertically in an isothermal atmosphere in gravitational equilibrium just as if no 
dissociation took place or 

N = (13*1) 

the value of H for water vapour being nearly equal to that of atomic (not molecular) 
oxygen. In full equilibrium, therefore, the water-vapour population would gain over 
the molecular oxygen population with increasing height. This argument, however, 
overlooks the escape of hydrogen from the earth’s atmosphere. The view was put 
forward by Chapman & Milne ( 1920 ) that there is no appreciable amount of free 
hydrogen in the atmosphere. This is generally accepted, and, if true, must have as 
a consequence the result that water vapour falls off with increasing height far more 
rapidly than is given by equation (13*1). If the temperatures at great heights were 
known one could calculate the rate at which hydrogen atoms escape from the 
atmosphere from a given population distribution; and at the same time work out the 
populations of hydrogen atoms and water-vapour molecules at all heights if a single 
constant were known—^the rate at which water-vapour molecules are transported 
over unit area of the tropopause. This constant is related to the total number of 
water-vapour molecules in a column of unit cross-sebtion above the tropopause. 
As has been suggested earlier, it appears to be within the reach of observation in 


(12*3) 



Radiative equilibrium in the ionosphere 239 

the near future. The importance of this constant to ionospheric theory can hardly 
be overestimated. 

14. The physical arguments in §§ 6 to 13 are, from the point of view of the earlier 
sections, arguments about the absorption coefficient in the infra-red and ultra¬ 
violet as ,a function of height. The problem is complicated by the fact that one 
requires a knowledge of the temperature to determine the constituents of the 
atmosphere, so that even with the fullest knowledge of the atomic coefficients in¬ 
volved one would be compelled to proceed by successive approsdmation from an 
assumed temperature distribution. Since many quantities cannot at present be 
evaluated accurately, this will not be done. One may, however, inquire whether 
there are two substantially distinct regions heated by molecular oxygen and atomic 
oxygen and separated by a temperature minimum. 

Possibly the soundest method of estimating the atomic population at great 
heights is an appeal to the observed collision frequency. According to Farmer & 
Ratchffe ( 1935 ), an electron makes 1-3 x 10® collisions per second in the region. 
This experimental result is confirmed by Martyn & Pulley ( 1936 ). By comparison 
with the coUision frequency in the E region the latter authors deduce that the 
temperature in the F region is of order 1000 °. We may for definiteness suppose that 
at a height of 250 km. the molecular population is 4 x 10“, and that one-half of this 
is atomic oxygen (the result being either atomic or molecular nitrogen). Assuming 
that at this height one is well above the levels where Schumann-Eunge radiation 
is substantially absorbed, then from ( 9 - 2 ) and making an allowance for day and 
night averaging, jig = 6 x 10 *. Now at a temperature of 1000 ° the value of H for 
oxygen is 62-2 km., so that if the atmosphere is isothermal above 260km. we have 
by (10-3) 1’06 X 10*’ oxygen atoms and 3-10** oxygen molecules in a column of unit 
cross-section above this level-. Since for 0, k = 4'6 x 10 ~*®, the atomic oxygen 
radiation is just beconaing appreciably absorbed at this point, being reduced to 
g-o-46 Qj. o.g 4 of itg incident value.* With a solar-energy curve of 6000° each oxygen 
molecule absorbs 4 x 10® times as much energy as an oxygen atom, and the two in¬ 
takes of energy are roughly equal at 260 km. with the figures which we have adopted. 

Going upwards the temperature must increase, as the electron density decreases 
(the layer is the maximum electron density), so that atonxic population gains over 
negative-ion concentrations with increasing height. Going downwards the electron 
population again decreases, so that the oxygen molecular population will gain over 
the negative-ion population. Hence unless water vapour is present in sufficient 
quantity to compete with the infra-red radiation from negative ions at the 260 km. 
level, that level is, on the figures adopted, a minimum of the temperature. Going 
downwards from 250 km. the temperature increases until water vapour is encoun¬ 
tered. The fact that the recombination coefficient for the ii region (say 180km.) is 
much greater than that for the F^ region (say 260 km.) strongly suggests that there 

* If the region is formed by the ionization of 0 to 0+ one wotild expect this radiation 
to be reduced to of its initial value at the level of maximum ionization. 



240 


R. V. d. R. Woolley 

are many more negative (oxygen) ions in the lower region, and therefore that the 
temperature in the region is considerably lower than the temperature in the F^ 
region.* This may well be due to an appreciable water-vapour population in the 
lower region. 

The author acknowledges with pleasure the benefit he has derived from discussing 
ionospheric problems with Dr D. D. Martyn and Dr C. W. AUen. 


Referbnobs 

Angenheister 1932 Terr, Magn, Atmos. Elect. 37, 431. 
Appleton & Naismith 1935 Proc. Boy. Soc. A, 150, 685. 
Chapman & Milne 1920 Quart. J. B. Met. Soc. 46, 357. 
Elvey & Roach 1937 Astropkys. J. 85, 213. 

Farmer &, Ratcliffe 1935 Proc. Boy. Soc. A, 151, 370. 
Godfrey & Price 1937 Proc. Boy. Soc. A, 163, 228. 

Gold 1909 Proc. Boy. Soc. A, 82, 43. 

Gowan 1928 Proc. Boy. Soc. A, 120, 655. 

Gowan 1930 Proc. Boy. Soc. A, 128, 531. 

Ladenburg & van Voorhis 1933 Phys. Bev. 43, 315. 
Lindemann & Dobson 1922 Proc. Boy. Soc. A, 102, 411. 
Majumdar 1938 Indian J. Phys. 21, 75. 

Martyn & Pulley 1936 Proc. Boy. Soc. A, 154, 455, 463. 
Milne 1922 Phil. Mag. 44, 872. 

Milne 1924a Phil. Mag. 47, 209. 

Milne 19246 Proc. Oamb. Phil. Soc. 22, 493. 

Pannekoek 1926 Bull. Astr. Insts Netherlda, 3, 207. 
Pannekoek 1930 Handbuch der Astrophysikf 3, 288. 
Willard Gibbs 1875 Sci. Papers, 1, 144, 171. 

Wulf & Deming 1936 Terr. Magn. Atmos. Elect. 41, 302. 
Wulf & Deming 1938 Terr. Magn. Atmos. Elect. 43, 283. 

* I owe this remark to Dr Martyn. 



Atmospheric tides in the ionosphere 
I, Solar tides in the region 
By D. F. Martyn, D.So., 

Australian Council for Scientific and Industrial Research 
{Communicated by Sir Edward Appleton, FM,S,—Received 23 April 1946 ) 

A theory, based on. solar tides, is advanced to explain the anomalous seasonal, diurnal and 
geographical variations of .region ionization. It is shown that the horizontal winds due 
to these tides must cause electrons to move along the lines of the earth’s magnetic field. 
The resultant motion has a vertical component. Account is taken of polarization of the 
medium by the ‘dynamo’ electric forces. Owing to viscosity the vertical motion decreases 
upwards in the region. Application of the equation of continuity shows that the F^ region 
becomes greatly distorted. A ‘longitude effect’ is found to arise by reason of the asymmetry 
of the earth’s magnetic field. The theory is used to explain the high F^ ionization densities 
found in low latitudes, and the high values of at noon near the equator. It is also used 
to explain the afternoon and night-time increases in ionization found at certain locations. It 
is suggested that the effective recombination coefficient mF^i& much lower than the generally 
accepted values. It is shown that Appleton & Weekes’s evidence of lunar tidal effects in the 
E region does not oonfiiict with the ‘dynamo’ theory of magnetic variations or with 
Pekeris’s calculations. 

Observational evidence of the existence of solar tides in the JP’g region is presented. 


1 . iNTROBtrOTIOlT 

Soon after the experimental discovery of the ionized regions of the upper atmosphere 
Chapman (1931) proposed a theory of their production. According to this theory, 
if the ionization ultimately disappears as if by a recombination law, then a given 
radiation acting on a given atmospheric constituent will produce a maximum density 
of ionization A'(max.) at a certain level in the atmosphere. This level will depend 
on the absorption coefficient of the constituent for the radiation, on the temperature 
of the atmosphere, and on the solar zenith distance x- When A'(max.) reaches a tem¬ 
poral maximum, which usually happens very soon after noon, then theoretically it 
should attain a value proportional to cos* X' 

This theory has been applied by Appleton to a number of ionospheric problems, 
and with particular success to the E and regions. AH attempts to apply it to the 
uppermost and most densely ionized region, the region, have met with serious 
difficulties. For example, the noon value of iV"(max.) in this region is usually much 
greater in winter than in summer, while the distribution of noon A'’(max.) over the 
globe is much more complicated than that indicated by the cos* x law. (In a recent 
unpublished note on the ‘Morphology of the F^ region’, Appleton has produced 
evidence that noon A"(max.) depends to some extent on magnetic latitude.) 

A suggested explanation of the low values of summer JV(max.) put forward 
independently by Hulbert (1934) and Appleton (1935) is that the F^ region is much 
hotter (some ten times) in summer than in winter. Martyn & Pulley (1936), however, 

[ 241 ] 



242 D. F. Martjm 

have advanced evidence that the region attains a temperature of the order 
1000° C all the year round, a conclusion which receives strong theoretical support 
from considerations of radiative equilibrium in that region (Godfrey & Price 1937; 
WooUey 1946). 

It appears desirable, therefore, to explore other possible explanations of the 
peculiarities of the F^ region. Up till now, with the exception of the early work of 
Hulbert (1929), theoretical treatments of the ionosphere have assumed that the 
upper atmosphere was essentially static. We explore here certain aspects of a 
dynamical theory of the ionosphere. In this series of papers the view is propounded 
that the F^ region is subject normally to marked distortion by the vertical transport 
of ionization with velocities var5dng both with height and time. In this way the 
observable quantities N{ma,x.), the height at which N(m&x.) occurs, and ¥, 
the minimum equivalent height, can be considerably affected, even though the total 
ionization in the region remains unaltered. (In general, however, the total ionization 
in the region vMl be affected if the effective recombination coefficient a' is a function 
of height.) 

Evidence will be adduced that such vertical movements of ionization are caused 
by the air movements associated with the solar atmospheric tide. Although this 
tidal motion is predominantly horizontal, ions in the upper atmosphere are con¬ 
strained to move along the lines of the earth’s magnetic force, which has a vertical 
component except at the magnetic equator. Looked at in another way, which it 
will be shown later leads to an identical result, the horizontal movement of ionized 
air across the vertical component of the earth’s magnetic field gives rise to hori¬ 
zontal electric forces, as in the ‘dynamo’ theory of magnetic variations originally 
due to Balfour Stewart (1882) and developed by Schuster (1908) and Chapman (1919). 
Now these horizontal electric forces, acting across the horizontal component of the 
earth’s field, will give rise to a vertical transport of ions, all kinds of which wiU move 
in the same direction. It is a further feature of the ‘dynamo’ theory that a certain 
distribution of electrostatic charge must occur over the earth’s atmosphere such as 
to produce a horizontal electric field which in low latitudes opposes the weaker 
dynamo field, so closing the current circmt in each hemisphere. It follows that even 
near the magnetic equator there exist electrodynamical forces capable of causing 
vertical transport of ionization, though in these regions such movement will have 
a sense opposite to that happening simultaneously at high latitudes. 

In the present paper data from a number of ionospheric observatories are 
analysed, and a number of phenomena presented which show the existence of a 
large semi-diurrutl harmonic in N{iaax.), hp^^- and h'. An eleotrod3mamical theory 
of the vertical transport of ions will then be developed, starting from the general 
surface harmonic term in the velocity potential of the solar tide. Although modern 
tidal theory casts doubt on the existence of a true velocity potential, the mnin fea¬ 
tures of the actual tide could be represented by a small number of such terms. It 
will be found that the asymmetry of the earth’s magnetic field with respect to the 
geographic poles introduces longitude terms whose effects are specially important 



243 


Atmospheric tides in the ionosphere 

between latitudes 25 and 65 °. An examination of the effect on and of the 
vertical gradient of velocity will then be made. In conclusion, the application of 
the theory to the interpretation of the main anomalies in region behaviour will 
be discussed. 

In subsequent papers in this series it is intended to apply the theory in detail to 
the elucidation of lunar tidal effects, and to the mechanism of formation of the 
so-called abnormal or sporadic H ionization. 


2. Evidbnce sttogbsting thb existence oe tidal 

OIECITLATIONS IN THE EEGION 

2-1. Before proceeding to develop this theory quantitatively in detail, it is 
desirable to set out certain data which appear to suggest the importance of tidal 
circulations in the ionosphere. 

In a recent communication Martyn (1945) has presented evidence that noon 
at Huancayo, near the magnetic equator, shows a seasonal variation which is 
like that recorded at northern hemisphere observatories for the months when the 
sun is north of Huancayo, and like that at southern observatories when the sun is to 
the south. The somewhat more complicated seasonal variation of noon A"(max.) 
shows a similar phenomenon. This behaviour, which is similar to that of the 
diurnal variation of magnetic declination at Huancayo (Fleming 1939), suggests 
that the variations of like the magnetic variations, may be due to hemisphere¬ 
wide tidal circulations. 

2-2. Some further evidence suggestive of tidal action in the J2 region is derived 
from a study of the times at which h' reaches maximum values in the course of its 
diurnal variation. It is a striking feature of the results from many ionospheric 
observatories that h' and attain maximum values not only near noon but also 
near midnight. This is clearly demonstrated in figure 1, which shows the diurnal 
variations of h' and for Mount Stromlo (Canberra), averaged over the 4 years, 
1941 - 4 . While various hypotheses, such as that of heating, might be invoked to 
account for a noon maximum, it is dificult to see why another maximum should 
ensue near midnight. On the other hand, if this tidal hypothesis is correct, a 12 hr. 
periodicity in such phenomena should be expected. 

To examine this point more closely the monthly mean hourly values if h'j,,^ for 
Mount Stromlo have been selected, which have the advantage of being available 
without intenraption for 100 months, from April 1937 to date. The mean diurnal 
curve for each month was plotted and examined for maxima. In general, one 
maximum only was found during the night hours, and the time of its occurrence 
was noted to the nearest half-hour. During the day-time there was normally only 
one maximum present, and its time also was noted. Occasionally, particularly in the 
summer months, the curve took a small dip near the normal maximum, giving two 
maxima close together. In this case the time recorded was the mean time of the two 



244 


D. F. Martyn 

close maxima. This method of treating the data, though obviously crude, suffices to 
demonstrate the point. In figure 2 are plotted the times of the day and night maxima 
respectively. It will be seen, first, that over the whole period the average day-time 
maximum occurs very nearly at noon, while the average night maximum occurs 
very close to midnight. There is thus an average periodicity of almost exactly 12 hr, 
in the occurrence of the maximum over 100 months. Secondly, it will be observed 
that the time of occurrence of each maximum shows a seasonal swing, being about 
an hour before midday (or midnight) in summer, and an hour after midday (or mid¬ 
night) in winter. The fact that the curves swing in phase from summer to winter, 
constantly preserving a time difference of 12 hr., gives further support to the 
hypothesis that one is concerned with a semi-diurnal solar tide. 



Figtjbb 1 . Average diurnal and Canberra, 1941-4 inclusive. 



Figxjbe 2. Monthly average time of occurrence of maximum h'jp^ at Canberra, 1937-45. 
Curve I, day-time maximum; curve II, night-time maximum. 



245 


Atmospheric tides in the ionosphere 

A more accurate study of this poiat is provided by harmonic analysis of the 
diurnal variations. For this purpose as determined by the method of Booker & 

Seaton (1939), has been selected. This quantity is apparently the most definite index 
of height variations in the region available without excessive labour. The data 
subjected to analysis are again for Mount Stromlo, consisting of the monthly mean 
hourly values of average over the years 1941-4 inclusive. In table 1 the results 

of this analysis are presented for the first four harmonics, whose amplitudes are 
represented by Pj, etc. km., and times of maximum by t^, etc. hr. after local 
midnight. 

It will be observed from the table that P^ is by far the most consistent harmonic, 
in both amplitude and phase. P^, although larger than Pg in winter, is subject to 
such large changes of amplitude and phase as would make one hesitate to suggest 
the presence of a diurnal tidal influence. In this.connexion it will be realized that if 
the semi-diurnal tide affects differently by day than by night (a conclusion 
which will be reached subsequently by theoretical reasoning), then it would give 
rise to a'^term in It will also be noticed that the phase of Pg swmgs from winter to 
summer in a manner similar to that exhibited by h' in cruder if more graphic form 
in figure 2. 

Table 1 . Habmonio analysis oe avbeagb DitTBNAL vabiation h^- 
AT Mount Steomlo bob yeaes 1941-4 inclusive 

amplitude in km. time of max. in hr, after midnight 

r- 

month Pi 

Jan. 2-07 

Feb. 6-40 

Mar. 23*2 

Apr. 30‘4 

May 31*5 

Jxme 31* 1 
July 26*3 

Aug. 22*7 

Sept. 15*8 

Oot. 10-6 

Nov. 5-75 

Dec. 4-13 

2*3. The evidence so far advanced relates only to though mention has been 
made in § 2*1 of the fact that iV^(max.) at Huancayo shows a behaviour similar to 
that for y. It is common knowledge that there is a close relation between Tnl and 
N (max.) for the Pg region, large values of A' being usually accompanied by low values 
of N{ma>x,), It must be expected, however, that any semi-diurnal tidal term in 
^^(max.) will be overshadowed by the large diurnal ionizing term due to direct solar 
radiation. In seeking the former term it is clearly preferable to compare the diurnal 
variation of iV'(max.) at two or more latitudes so chosen that while the ionizing term 
(proportional to cos^ varies little, the tidal term (whose variation with latitude is 



Pz 

P4 

h 

^2 

h 

h 

20-4 

6-8 

6*7 

7*9 

11*6 

6*2 

3*3 

19*8 

7*6 

6*2 

22*6 

11*7 

3*9 

4*1 

16*6 

5*1 

6*4 

0*2 

12*2 

4*8 

4*2 

14*4 

6*6 

2*9 

0*5 

12*4 

4*2 

1*5 

14*4 

4*8 

3*8 

0*2 

12*8 

3*3 

1*6 

13*6 

3*9 

1*7 

0*3 

12*7 

4*0 

0*6 

13*4 

7-1 

3*4 

0*2 

12*5 , 

3*7 

0*6 

14*0 

6*9 

3*1 

1*0 

11*8 ' 

3*9 

5*9 

14*7 

7-7 

2*6 

1*0 

10*8 

4*1 

4*6 

16*1 

4*3 

2*8 

3*6 

10*9 

5*8 

3*8 

18*4 

M 

6*6 

3*3 

10*9 

6*4 

3*2 

14*9 

3-2 

9*2 

22*1 

11*7 

5-6 

3*2 


VoL 189. A. 


16 



246 


D. F. Martyn 

developed in § 4-2 below) varies markedly. Three suitable sites for this purpose are 
Cape York, Queensland (lat. 11-0° S), Brisbane, Queensland (lat. 27 - 6 ° S) and 
Canberra, A.C.T. (lat. 36 - 3 ° S). The monthly mean hourly values of for the equi¬ 
noctial month Mardh 1945 are selected. This month is chosen for the reason that, the 
sun being over the equator, asymmetrical terms are absent in the tidal potential. 
In figure 3 are plotted two curves: (1) the difference between corresponding hourly 
values of/^, at Cape York and Canberra, and (2) the difference between the same 
quantities for Brisbane and Canberra. 

It is at once apparent that the main variable term in the difference between the 
diurnal variations at each of these two pairs of stations is semi-diurnal, in accordance 
with the previous findings for h' and A™**-. There is also a large constant-difference 
term between the average diurnal values of /p, at the three latitudes, as evidenced 
by the displacement of the graphs above the zero of/^,. The average values at Cape 
York, and to a lesser extent Brisbane, are clearly considerably higher than those 
at Canberra. A simple explanation of this phenomenon will emerge from the theory 
outhnM below. 



Fisubb 3. Difference between average diumal variations of/;?, at Cape York, Brisbane and 
Canberra for March 1945. Curve I, Cape York-Canberra; curve II, Brisbane-Canberra. 




247 


Atmospheric tides in the ionosphere 

3. Motions of ions due to hoeizontal "winds 

3 -1. In figure 4 t,x,y,z form a system of right-handed axes. The earth’s magnetic 
field H is directed along the z axis, which is inclined downwards, thus simulating 
conditions in the northern hemisphere. CA Hes in the magnetic meridian, so that / 
is the magnetic dip. AB is the (horizontal) vector representing the air velocity Cq, 
so that TT-l-^ is the direction of this wind with respect to magnetic north. 



Figxtrb 4. Diagram of right-handed axes system used in 
deduction of components of ionic motion. 

In the absence of a magnetic field ions would be swept along with the wind with 
the velocity Cq. Such motion may be produced by the action of an average force JJ, 
due to the air. The application of the simplest considerations of kinetic theory to a 
lightly ionized gas gives 

i^o = mvco, 

where m is the mass of an ion, v is the mean collision frequency of an ion with the air 
molecules or atoms, neglecting constants of order unity which would arise from 
considerations of velocity distribution. and are written for the components of 
Fq along the z and x axes respectively. The presence of a magnetic field will not affect 
Jo but will have a profound influence on the resulting ionic motion. If y', z' be 
the mean ionic velocities in the presence of'JT, then it may readily be shown (Chap¬ 
man & Bartels 1940) that 


* ~ mvv^+(o^ ~ 

(1) 

Fj_ 0)^ ^ VO) 

(2) 


z' = c„. 

( 3 ) 


where w = Hejm. 

e is the ionic charge, and and are the components of Cq along the z and x axes 
respectively. Equations (1) and ( 3 ) express the well-known conclusion that diffusion 
of ions across a magnetic field is impeded, while that along the field is unaffected. 
Equation (2) expresses the ‘dynamo’ electric force, motion of ions of opposite signs 




248 


D. F. Martyn 

being in this case in opposite directions owing to the presence of 0 in the first power 
in the equation. In general, then, ( 2 ) will give rise to an electric current and also to 
a polarization of the medium. 

3 - 2 . Consider first the case where polarization is negligible. If v be the vertical 
component of ionic mass movement or ‘drift’, measured positively downwards, 
then by equations ( 3 ) and (1), and remembering that Ci, = — Cq cos^ cos I, then 

V = CiiSinJ+c, sin / cotLOAB-s -.—5 

II ± v®+a)® 

0 )^ 

= - |C(, cos (l> sin 2 / . ( 4 ) 

In the region for both electrons and ions, so that the vertical drift velocity 
is very nearly equal to the vertical component of the air velocity component c,. 
There is thus a vertical migration of ions of aU kinds upwards or downwards according 
as ^ $ I I, idth a velocity of the same order of magnitude as the horizontal wind velocity, 
and given exactly by equation ( 4 ). 

3 - 3 . Let it now be supposed that the polarization of the medium is such that the 
current due to y' is inhibited. Let the electric field due to this polarization be 
then for no current it follows that 

j/i = -y', 

or 

or = mMC^. 

Due to jEJj there is now an additional x velocity Xq such that 

., _ E^e 0^ _ 0^ 

~ e/r v2+(ua “ 

The total x velocity is now 

The ionic flow along the x axis is now unimpeded by the magnetic field, and on 
combining the flow along the x and z axes Cq is obtained. Thus when polarization 
prevents the flow of current due to the ^ dynamo^ electric force the ions flow in the air 
stream completely unaffected by the magnetic fields and no vertical drift occurs* 

3*4. In general both current flow and polarization will occur in the earth’s 
atmosphere. In fact some polarization will usually be necessary in order to permit 
the current to flow in a closed circuit, as in the dynamo theory of magnetic varia¬ 
tions. In such events the total vertical ionic drift may be deduced by combining 
that due directly to air motion, with V2) that due to the electric polarization 
field TJ, which may be capable of specification when the boundary conditions are 



249 


Atmospheric tides in the ionosphere 


laid down. Ionic drift can only be produced by the component of E perpen¬ 
dicular to H. If E^ is further resolved into two components E'^ and E"^ lying respec¬ 
tively in the horizontal plane and in the vertical plane containing H, then only E'^ 
can produce vertical drift. If E'^_e is written for F^, and the axes rotated about z 
until that of x is horizontal and westerly, then application of equation (2) gives 

., _ E'^e oF 


and so 


*^2 “ + 


H v^+o)^ 


cos I, 


( 5 ) 


the disappearance of e from the equations indicating that (and v^) are now drift 
velocities independent of ionic sign. 

It is now possible to deduce the total vertical ionic drift, provided the horizontal 
air velocity and electrostatic field be known. 


4. Vertical motions or ions due to atmospheric tides 

4 - 1 . It will be assumed that the horizontal winds associated with atmospheric 
tides can be derived from a series of velocity potentials, a typical term of which, 
7 ^, is a surface harmonic of the general form 

where is a constant factor expressing the amplitude of the harmonic of order cr 
and degree n, is the corresponding associated Legendre function, A is the 
longitude at the selected point, t is time referred to a fixed meridian and is a phase 
constant, is now derived in terms of ijr. Equation (4) shows that v-^ is pro¬ 
portional to the north component — Cjcos^i of wind velocity, which may be 
expressed as df’jadd, here 6 is the co-latitude, and a the radius of the earth. I may 
also be expressed in terms of 6 using the well-known relationship tan J = 2 cot 6 . 

Equation (4) then becomes . ,, » 

_ sin 2^ d^ 

~ 4—3sin^dad&‘y^+(i)^' 


In deriving V2 it is assumed, in the first instance, that the conductivity, at the selected 
level, is uniform over the earth. In this case Schuster (1908) has shown that the 
horizontal electrostatic field is derivable from a potential S given by 


It follows in this case that 


„ Hsmldih-1 

.S= + - n.n + 1 . 

00s a dA 


and so, by equation ( 5 ), 


dS 


2HmiId^\lr 


I 1 :—=-I- . ■,<« in .n+\, 

asmOdX ^ asm 26 d\^! 


jn.i 


sin 2/ 


0)2 (^2^ 


-1-1) a sin 20 v®+<i>®dA® 
2 o'hl/' oF 


%(»-!-l)a( 4—3 sin^d) 


( 5 a) 



250 

Hence 


v = Vi+Ua 


D. F. Martyn 


_ [ . ^a^rjr 2 ( 7 ® ,1 

“ o(v®+o®) ( 4-3 sin® 6 ) ^'dd n(n + 1 ) ’ 

Applying now the reduction formula 

dP 

= <7COt0P^<,-P„,^+l, 

and expressing \jr in its most general form, then 

___ 

^ ~ a(v®+<t)®) ( 4—3 sin® 6 ) 

X i^^s[2cr^cos® ^ - sin sin (<7(A + t)- <). 

In the important case of the sectorial harmonics (n=<r}, the second term within the 
brackets vanishes, since P^^^+i = The dependence of v on co-latitude 6 is illustrated 
in figure 5 a for the sectorial harmonics ijrl and ^|, and for the tesseral harmonics 
^ and i/rl. In the figure is plotted the fonction F( 6 ) given by 

_ Q*\ 1 ^ 4 ' 

^{ n+<r) \ ] roughly to normalize the coefficients of the 

various harmonics. It is seen from the figure that, in general, for each harmonic 
the sense of the vertical velocity changes in passing from hi gh to low latitudes. 



Figdee 5 a. Dependence of vertical electron velocity on co-latitude for various tidal harmonics. 
Curve I, F(d) for curve 21, J'(d) for curve ni, F(d) for fl; curve IV, F{6) for 



251 


Atmospheric tides in the ionosphere 

4'2. When the non-uniformity of conductivity over the earth is taken into 
account the problem presents considerable analytical difficulties. Use may be made 
of certain results of Schuster (1908) in order to assess the importance of this effect. 
The latter considered a variation of the conductivity p given by 

p = Po{l-l-j77-sia(J cosd-t-jTTCOsdsmd cos (A 4 -i)}, 

where S is the solar declination. 

It may then be shown, following Schuster, that 

H sin I 

^ S3P«,.rCos(r(A-f-0-a;). 

The tidal velocity potential whose amplitude is A%^ now gives rise to an infinite 
series of harmonics in the polarization field. The amplitudes of these harmonics 
depend on the coefficients AJt, some of which have been evaluated by Schuster for 
the two cases Al and A\. The more important of these are as follows: 

Case 1. Al: kl = = -r/ 72 . 

Case 2 . A^: ^ = -^ 7 , = 

where 7 = j7rcosd and y^sa^Trsind. 

To compare the amplitudes of the vertical velocities due to the various harmordcs 
dSjdX must be compared for each harmonic. Thus for the case of Al the ratio 
(S!'j)ll{E'j_)l is found to be 67r(cos® 6 —i )/98 sin* 6 , which is much less than unity save 
in the immediate vicinity of the poles. A similar result is obtained oh comparing- 
the other principal harmonics arising from non-uniform conducti-vity with the 
main term. 

It appears therefore that such inferences as may be made regarding vertical 
motions of ions from the simple treatment given above in § 4-1 are not likely to be 
seriously affected by the assumption of uniform conductivity. This conclusion is 
not, of course, applicable to the dynamo theory of magnetic variations, which is 
ultimately concerned with horizontal electric cmrents, and not, as in this paper, 
with horizontal electric 

4 - 3 . So far consideration has been given to the consequences of horizontal motions 
of ions across only the vertical component of H. East-west motion across the 
horizontal component of H must give rise to vertical electric forces whose effects 
win now be examined. Ar ound a given parallel of latitude these forces will be directed 
upwards at some points and downwards at others, so that polarization may be 
expected to result in the completion of current circuits in. the vertical east-west 
plane. In his classical paper Schuster drew attention to this possibility (1908, 
para. 8), but pointed out that the magnetic consequences at ground-level would be 
negligible, since the sum of the current flow at aU levels above a given point due to 
this cause would be zero. Our concern, however, is with the consequences, at any 



252 


D. F. Martyn 

selected level, of horizontal electric forces at that level. Clearly there is here a cause 
of vertical ionic motion whose magnitude must be examined. 

Pekeris (1937) has shown that the amplitude of the tidal velocity changes 
markedly with height when account is taken of the vertical temperature gradient. 
There is also a change of phase'of tt at a certain level. Some unpublished work by 
Gr. Odgers seems to show that further changes of phase and amplitude occur when! 
Pekeris’s work is extended to the higher regions of the ionosphere above 100 km. For 
the purposes of the present analysis it will be assumed that ^ contains a factor 
sin(6y—yff), where y is the height above an arbitrarily selected datum level, ^ is 
unknown, and 6 is of order of magnitude 100 km.“^. The x axis horizontal, pointing 
positively to magnetic east, is chosen. The surface conductivity Pg is assumed 
uniform in the vertical current sheet to be considered. Ey are written for the' 
dynamo electric forces in the x, y directions respectively, and E'y for the corre¬ 
sponding forces due to the polarization potential 8 . The current function in the 
sheet is represented by R. Then 

= 0, Ey = —^^jff/siad, cos I 


= Hjava.6 .008,mi{by—^) cos(crA —a) 

= jB/sin 0. sin {by - fi) cos (crA—a), say, 


V'-Jl W'- 

dx’ 


dy^ 


= ~Ps^> ^ = -PsaT -pg£/sinB.cos(ora-iiB-a)sin(i)3/-^). 


_dR__ M ^ 

dy ~ ^^dx ’ dx ~ ^’‘dy 

Eliminating 8 from the latter two equations then 

d^B 

= -PgJ 5 /sin^.(ra-i sm((ra-%-a) sin{6y-/?), 

the relevant solution of which is obviously 

B — apg J 5 /sin d. cr{a^b^ sin {cra~^x — a) sin {by—^). 

It follows that E'j_ (as formerly defined) is given by 




El = -p; 


dy 


- H /sin 6 . cos Ia%{a%^ -f ^ cos {by —/B), 


and so, by equation (6) 


sin 6 b (0^ 

“ ~4:-Z8m^ea^^+(7^v^+0^ 


To assess the magnitude of this velocity due to vertical polarization, compare it 
with that in equation (6a), which gives the corresponding velocity due to horizontal 
polarization. Neglecting terms of order unity it is seen that the ratio of the two 



263 


Atmospheric tides in the ionosphere 

velocities is abl{a^b^ + <r^). Now a is 6400 km. and b is of order 100 km.“^, so that for 
all the lower order harmonics likely to be of importance, due to vertical polariza¬ 
tion is small compared, with that due to horizontal polarization, and will not be 
discussed further. 

4 - 4 . So far it has been assumed that the magnetic and geographic poles coincide. 
The removal of this assumption leads to the appearance of terms in v whose magni¬ 
tudes depend on A^, the longitude measured from the meridian (68° W of Greenwich) 
which contains the earth’s •magnetic pole. The general problem has been attacked 
by Schuster (1908) and Chapman (1919) in developing the dynamo theory of 
magnetic variations. These authors’ results will be applied to this problem for the 
special case of ^|, which tidal theory indicates as likely to be the most important 
term in the velocity potential. If the earth is regarded as a uniformly magnetized 
sphere, whose magnetic axis is inclined at an angle 0 to the geographical axis, then 
it follows from these authors’ results (e.g. Schuster 1908, p. 171 ), that the additional 
term in is 

^ i^-^i^*a»^^{(27-Pi,i-2P8.i)sin(2«'-Ao-a)-hP8.3sin(2r-l-Ao-a)} 

= a~^Gtan< 5 j 4 |oo 8 d {^[(4 — 3 cos 2 d)®cos®A 8 -)-sin®Ao]sm( 2 i' —a-l-e), (6) 

where G is a constant of the earth’s field, and tane = (3 cos 20 — 4 )-^ tan Aq. The 
expression under the sign has its maximum value when Aq = 0 or n, and its mini¬ 
mum when Aq = Jtt or |7r. When Aq = 0 the additional term in becomes 

—tan<PJ.| cos 0 ( 4—3 cos 20) sin (2i'—a), 

and so the additional term m v is 

/>)2 

— a~^ H~W cos I tan 0 A\ ^( 4—3 cos 20 ) sin 

Now when Aq = 0 the well-known formula for the distribution of E round a mag¬ 
netized sphere becomes 

Gcos/_ 2 sin (0—0) 

H 4 - 3 sm 2 ( 0 - 0 )’ 

so that the additional term in i) is 

0-1.41 tan0^;5^2 cos 0 ( 4 - 3 cos 20 ) sin {%t' - a). 

(This expression contains the additional terms in both % and v^, since equatidn (6) 
has been derived from Schuster’s current function, which includes both the dynamo 
and the polarization field.) The function jPi( 0 ) given by 



254 


D. F. Martyn 

is plotted in figure 56 . The factor 0*068 is the product of the Schmidt normalizing 
factor for ^ tan05 which has the value 0*202. It will be observed that i\{d) 
is unsymmetrical about both the geographic and the magnetic equators. It attains 
its greatest value south of both equators for the meridian Aq == 0. 



Figure 66 . Additional vertical velocity term due to magnetic asymmetry. 


The term in v due to magnetic asymmetry may now be compared with the main 
term, for the case of by comparing the curve in figure 6 6 with curve II in figure 6 a. 
Study of these curves shows that the asymmetric term is comparable with the main 
tetm over a range of latitudes from about 26 to 55 ^. It is concluded that over this 
range of latitude there will appear a ‘longitude effect’ in v, whose magnitude and 
phase wiU depend on longitude measured from 68° W of Greenwich. 

5. The nsTFLUENOE of velocity-height geadibnt 

5 * 1 . It remains to consider the gradient of v with height, and the effect of this on 
the form of ah ionized region. The work of Pekeris (1937) and of Odgers (unpublished) 
shows that in the lower regions* of the ionosphere the variation with height of tidal 
velocities (on which v ultimately depends) is not easily formulable. In these regions 
damping is negligible and tidal velocities are a very complicated function of the 
temperature-height gradient. There is little doubt, however, that at the height of 
the P2 region damping due to viscosity and heat conduction are important terms in 
the equations. (Pekeris, indeed, concludes that at 200 km. damping is very large, 
and dominates the other terms, but it seems certain that this is an over-statement 




255 


Atmospheric tides in the ionosphere 

baaed on the assumption, no doubt for reasons of mathematical simplicity, that the 
average temperature of the ionosphere is only 210° K.) Both the kinematic viscosity 
and the thermometric conductivity, on which damping depends, are inversely 
proportional to the gas pressure, which decreases exponentially with height. It 
seems reasonable to assume then that, in the region, tidal velocities, and therefore 
V, decrease exponentially with height. Therefore 


V = Ve~r^, 


where z is the ‘ reduced ’ height (Chapman & Bartels 1940, vol. 1, p. 606 ), measured in 
units of Sg the ‘ height of the homogeneous atmosphere and 7 is a positive constant 
of order unity. 


5'2. When account is taken of the vertical motion, the equation giving the 
electron density is 


8t 


= r(z,t)-a'N^ + 


8 (Nv) 
dz ’ 


( 7 ) 


where /'(z, t) is the rate of ion production per c.c., and a' is the effective recombination 
coefficient after account is taken of attachment and detachment. The values of v 
necessary to make the last term important are examined first. Since N and v both 
may be expected to have, in general, an exponential variation with height, the 
order of magnitude of this term will be that of Nv, if v be measured in units of Hg. 
The ratio of the orders of magnitude of the two last terms in equation ( 7 ) is therefore 
vJa'N. N is about 10®/o.o., and the usually accepted values of a' are about 
(It must be remembered, however, that the method used for determining a' (Apple- 
ton 1937) ignores the i n fluence of v and may give much too high values if the views 
here presented be accepted.) Vertical movement becomes important therefore if 
V is of the order lQ-*Hg, or 400 cm./sec. The velocity of the wind associated with the 
solar tide is about 30 cm./sec. at ground-level, and calculations according to Pekeris’s 
methods lead to amplifications of several hundred times at ionospheric levels. There 
is therefore little doubt that movement is an important and probably a dominant 
factor in determioing the observed values of N. 

6-3. In order to bring out the important consequences of movement in as simple 
a TYinnTiftr as possible night-time conditions (/' = 0) are considered, and recombina¬ 
tion neglected.* 

An initinl distribution of ionization corresponding to a Chapman region is 
assumed, i.e. 

Ng=ngexpl{l-z-aeox^~"h 

Under these limitations the solution of equation ( 7 ) becomes 


* Dr J. C. Jaeger is making a special study of this equation with the ionizing term included. 
The author is much indebted to him for making available his preliminary results, and for 
valuable discussion. 



256 


D. F. Martyn 

where = (I +yVte~^^). A maximum occurs in N at the height given by 
secxe-*'" Vm—{y+ l)/y-(27+!)??»»’'+ ^7 = 0. 

In figure 6 JV is shown as a function of z, for x = 90 °, for two values of y ( 0-26 and 1 - 0 ), 
and for several positive values of/? {^yVt), i.e. for v downwards. It will be observed 
that in aU cases the value of JV(max.) is less than JV^Cmax.) and becomes progressively 
pTYiallftT with lapse of time (fi increasing). It is particularly to be remarked that while 
for small values of y( 0 - 26 ) decreases steadily with time, for Icvrger valim (y = 
0 ' 5 , 1‘0) the inaximum rises. (The curves for y = 0'6 are not reproduced, to avoid 
confusing the figure.) Figure 7 shows the same type of curves for upward velocities 
(fi-ve). It will be observed in these cases that Nim&s..) increases with time and 
z„j rises iI]^ general. There is an obvious tendency for the region to develop a sharp 
gradient on the under-surface, a tendency which eventually must be balanced by 
diffusion. 



FisxntB 6. Effect of downward electron velocities on Chapman region. Curve I, y=0'26, 
/d=0-25; curve 11, y= 0-25, yS=0-6; curve HE, y=l-0, /?=0-5; Curve IV, 7=1-0, jd=l-0. 

For small values of y? (small velocities or small lapse of time) it is easy to show that 

equation ( 7 ) reduces to rr. 

z„ = 7F«(2y-l/y). 

At this height A'(max.) is then given by 

iV(max.) = »j(l-yFi). 




257 


Atmospheric tides in the*ionosphere 

Thus in this special case, which it will appear in a later paper is of special importance 
for lunar tides, the movement of the maximum is in the opposite direction to that 
of V (for 7 > 0 - 706 ) and iV"(max,) $ ^^(max.) according as F 5 0. This effect can be 
seen in figure 7 , for the curve represented by the parameters 7=1,/? = - 0 - 25 . It 
will be noticed that if in this case one is able experimentally to determine the values 
of and iV(max.) simultaneously, then one can evaluate both 7 and F. It has been 
found possible to do this for the lunar tide which has been found in the region, 
but the results of this investigation must be deferred to a later paper. 



Figxirb 7. Effect of upward electron velocities on Chapman region. Curve I, 7 = 0-25, 
/?= --0-25; curve II, 7 = 0 - 5 ,;^= —0*5; curve III, 7 =0-5,/?= —1-0; curve IV, 7 =l' 0 ,/^= —0-25; 
curve V, 7 = 1 - 0 , p— — 1 - 0 . 


6. CoNciiTJSioisr 

6-1. It is proposed in this section to review briefly some of the main features of 
jPg region behaviour and the application of the above theory to their elucidation, 
leaving the detailed application of the theory to later papers. 

First the value of a', the effective recombination coefficient in this region, is 
discussed. It has been a matter of considerable theoretical difficulty for some time 
that the measured value (c. 10“^®) is so much greater than the theoretical value for 
radiative recombination (0.10“^^)^ the other hand, there seems little evidence 
of a definite eclipse effect in this region, particularly when is well separated from 
Fy at the time of eclipse. If the view advanced in this paper is accepted, the usual 
methods of measuring a' from observations of the diurnal change of iV”(max.) must 
be invalidated, and very little evidence indeed is left regarding the value of cc\ 
In two papers, Woolley (1946) and Allen (1946) put forward evidence, the first from 



258 


D. F. Martyn 

theoretical and the latter from observational reasoning, that the time of decay of 

ionization is of the order of a day and a' < 

Whatever the precise value of a' in F^ may eventually prove to be, there is little 
doubt that it is very much less than that for Jj, which disappears as a separate region 
near sunset, and which shows a well-marked eclipse effect. The conclusion may safely 
be drawn that ionization in the F region which is moving upwards when formed by 
solar radiation will have a considerably longer life than similarly formed, ionization 
moving downwards. 

6 -2. It is now well known that the ionization densities found in low latitudes in 
the F^ region are much higher than can be accounted for by the simple Chapman 
theory. This fact is a natural consequence of the theory outlined in this paper. If 
Pekeris’s conclusion that the phase of the solar tide in the ionosphere is opposite to 
that at the ground is accepted, then at latitudes above about 36 ° (the precise value 
depending on the ‘longitude term’ of § ( 4 - 4 )) v is directed downwards between the 
hours of sunrise and noon (see figure 5 a). Figure 6 shows that for such conditions 
N{m.&x.) is reduced considerably below the Chapman value, and rises, as is 
observed. A further cause of reduced N, which will be effective throughout the 
ensuing 24 hr., is the flow of freshly liberated electrons downwards into the ‘sink’ 
at lower levels where a' is high. In the afternoon electrons will flow upwards, leading 
to the afternoon concentration (figure 7 ) which is a well-marked feature of stations 
at these latitudes. The fact that at high latitudes the region is relatively free from 
this effect in winter, but shows it strongly in summer, suggests the presence of the 
seasonal harmonic (figure 5 a) in intensity comparable with i/rl. After sunset there 
should be a rapid drop of ionization until midnight {v downwards), followed by a 
slower drop (or even a rise in N (max.) if enough electrons persists at the lower levels) 
lasting till about stmrise. These effects also are well known observational phenomena. 

At low latitudes from 0600 to 1200 hr. v is directed upwards and figure 7 shows 
that A'(max.) will increase to values well above the Chapman value, and will rise 
steadily in height (after a small mitial drop). Electrons are now being carried up¬ 
wards on liberation and will therefore contribute to a high general level of iV(max.) 
throughout the ensuing 24 hr. From 12 to 18 hr. electrons will move downwards 
again and and A(max.) will faU, but from 18 to 24 hr. there may occur a further 

concentration leading'to abnormally great iV’(max.). After midnight the values of 
N{max.) should commence to drop markedly until dawn. These phenomena too are 
well marked in the data from low-latitude stations. For example, the data for 
Cape York (lat. 11-0°S) used in figure 3 show a well-marked maxim um in iV(max.) 
at midnight. 

Observatories very near to the magnetic equator show a slight change in the 
above-mentioned diurnal sequence. Here towards noon attains very high values 
(c. 400 km.) and A'(max.) shows a well-marked minimum. Also, the diurnal variation 
of is retarded in phase a few hours behind h'. It appears Ukely that this 
distinctive ‘equatorial’ behaviour is associated with the very great increase in v 



269 


Atmospheric tides in the ionosphere 

(figure5a)whichoccursintheselatitudes.ThismustraisetheionizationtoabnoriXLally 
great heights, as is observed. At these heights damping must be very great and, as 
is well known from the theory of damped oscillations, there will be a phase lag which 
may amount in the limit to |7r. This would readily account for the phase lag in h^^*, 
and may also account for the noon 'bite-ouf in ^^(max.). This point will be more 
fully discussed in a later paper. 

6 - 3 . Appleton & Weekes (1939) have found a semi-diurnal lunar tidal effect in 
the E region such that this region is 0-8 km. higher than average near the time of 
lunar transit. It has not so far been possible to interpret this result as due to vertical 
air motion associated with the lunar tide (Chapman & Bartels 1940). Any such 
interpretation encounters the grave difficulty that the phase of the observed motion 
is almost in exact opposition to that demanded by the dynamo theory and by 
Pekeris’s calculations. The theory of vertical movements of ionization developed for 
the J?2 region in this paper cannot be directly applied to this problem, since there is 
no information regarding the height gradient of tidal velocities in the E region. 
It may be said, first, that the E region should have either a minimum or a maximum 
height at lunar transit, and so no theoretical impasse arises in the present state of 
our knowledge; secondly, it is a necessary consequence of our theory that for the 
small values of F associated with the lunar tide, there should be a phase difference 
of TT between the vertical movements of ionized regions in high and low latitudes 
respectively. Appleton & Weekes’s result refers to high latitudes ( 51 °N). It has 
been found that the E region lunar tidal effect at Brisbane (lat. 27 - 6 ° S) shows the 
minimum height occurring slightly before lunar transit. This is almost exactly in 
phase opposition to that found in England and would appear to give strong support 
to our theory of the mechanism involved. This point wiU be fully discussed in the 
second paper of this series. 

6 - 4 . Finally, it is desirable to investigate briefly the reason why solar tides appear 
to exert such a strong influence in the and not in the E and F-^ regions. The most 
likely explanation of this fact appears to be that both the rate of ion production, 
and a' the effective recombination coefficient, are much higher in the latter two 
regions than in jFg. These two terms may be expected to dominate the tidal term in 
equation ( 7 ) so that these regions behave approximately like a simple Chapman 
region. (It should be noted, however, that there are small but definite anomalies 
in the global distribution of and which may eventually prove to be of tidal 
origin.) Again it should be remembered that the vertical gradient of F is of decisive 
importance in determining the amount of region distortion. 7 is large in the jPg region 
because of damping. If the E and F^ regions are situated in regions where this 
gradient is not great they will be little affected, particularly as regards J\^(max.). 

This work has been carried out while the author was on an extended visit to the 
Commonwealth Observatory, Canberra, A.C.T. His thanks are due to the Common¬ 
wealth Astronomer, Dr R. v. d. R. WooUey for providing facilities and much helpful 



260 T>. F. Martyn 

discussion and criticism; also to Dr G. W. Allen, G. Odgers, and Dr S. C. B. Gascoigne 
of the Observatory stajBf. The valuable contributions of Dr J. 0 . Jaeger of the 
Mathematics Department, University of Tasmania, have been mentioned above, 
Mr J. W. Reed assisted considerably with computations. The data used in figure 3 
have been made available by the Australian Radio Propagation Committee. It is 
a pleasure to acknowledge the interest of the Chief Executive Officer of the Council 
for Scientific and Industrial Research, Sir David Rivett, E.R.S., who arranged the 
conditions necessary for the prosecution of this research. 


References 

AUen 1946 Terr, Mag^ Atmos, Elect, 51, 1. 

Appleton 1935 Phys, Rev. 47, 89. 

Appleton 1937 Proc, Roy, Soc, A, 162, 451. 

Appleton & Weekes 1939 Proc, Roy, Soc, A, 171, 171. 

Balfour Stewart 1882 Encyclopaedia Britannica^ 9th ed, 

Booker & Seaton 1939 Trans, Wash, Meet, Ass, Terr, Mag, and Elect, p. 503. 
Chapman 1919 Phil, Traris, A, 218, 1 . 

Chapman 1931 Proc, Phys, Soc, Lond, 43, 26. 

Chapman & Bartels 1940 Geomagnetism, Oxford: Clarendon Press. 

Fleming 1939 Terrestrial Magnetism and Electricityf p. 38. New York: McGraw Hill. 
Godfrey & Price 1937 Proc, Roy, Soc, A, 163, 228. 

Hulbert 1929 Phys, Rev, 34, 1167. 

Hulbert 1934 Phys, Rev, 46, 822. 

Martyn 1945 Nature, 155, 363. 

Martyn & Pulley 1936 Proc, Roy, Soc, A, 154, 456. 

Pekeris 1937 Proc. Roy, Soc. A, 158, 650. 

Schuster 1908 Phil. Trans, A, 208, 163. 

Woolley 1946 Proc. Roy. Soc, A, 187, 102. 



Temperature changes in the crystal structure of 
barium titanium oxide 

Bt Helen D. Megaw, Oirton College, Cambridge 

{Communicated by J. D. Bernal, F.B.8.—Burned 15 March 1946 ) 

[Plate 5] 

Barium titanium oxide, which is tetragonal at room temperature, changes about 120® C to a 
cubic structure* This change has been followed in detail by means of X-ray powder photo¬ 
graphs taken in a 19 cm. powder camera at intervals of a few degrees over a range covering 
the transition point. 

The unit cell, which contains the formula number of atoms, retains its identity throughout 
the transition, and the atomic parameters are unaltered. The change is simply in the axial 
lengths, and these vary continuously with the temperature, though not linearly, the varia¬ 
tion becoming more rapid near the transition point. While the linear expansion coefficients 
along and perpendicular to the tetrad axis are large and of opposite sign, the volume expan¬ 
sion coefficient is small and positive. There is no discontinuous change either of linear spacing 
or of volume detectable at the transition point, but there is a sharp discontinuity in the 
linear expansion coefficients, and a marked increase in the volume expansion coefficient 
which is probably, though not certainly, discontinuous. 

The transition suggests a typical A-point change. The specific heat has not been deter¬ 
mined, but the thermal expansion curve has the characteristic A shape. Oo-existence of cubic 
and tetragonal structures, in proportions depending on the temperature, occurs over a range 
of some degrees near the transition point, and is attributed to the effect of local stresses in 
facilitating or hindering a change between two structures whose energy difference is very 
small in this temperature range. 

Below room temperature, observations made down to —183® 0 suggest that the stmcture 
may have a second transition point somewhere below this and become cubic again, the 
change being of the same nature as that at 120® C. 

It is argued that the room-temperature structure can only be explained by the existence of 
directed bonds, and that thb breaking of these bonds with increasing temperature is respon¬ 
sible for the 120® C transition. The low-temperature transition is explained by postulating a 
more complete set of bonds, probably an octahedral complex, which partially breaks down 
at this temperature to give the square formation observed in the roonx-temperature structure. 

The possible nature of the directed bonds is discussed qualitatively. The condition which 
makes possible the formation of such bonds is likely to be the abnormal volume available to 
the Ti atom, which is due to the effect of the large Ba ion in forcing apart the oxygen 
lattice. The directed bond system will only contribute a smaU part to the attractive energy 
of the lattice, which is mainly ionic in charaioter. The hypothesis that directed bonds exist, 
whatever their origin, is used for a tentative explanation of anomalous variations of mtensity 
of the X-ray lines observed at temperatures near the transition point. 

Inteobxtotiox 

The crystal structure of barium titanium oxide (barium titanate) has recently 
attracted much attention. Early work by Bartb (1925) showed that it belongs to 
the perovskite type. This has a simple cubic lattice, with the formula number of 
atoms per cell, the atomic parameters being as follows: 

Ti 000, 

Ba iJi, 

0 iOO, OJO, 00 |. 


Vol. 189 . A. 


[ 261 i 


17 



262 


H. D. Megaw 

!Figur6 1 shLOws this struotuxo. It consists of a notwork of TiOj octahodra linked 
through each comer with their neighbours, the Ba ions being accommodated in 
the large central holes of the network, where they are surrounded by twelve 
oxygens. 



FiGHEtE 1. Structijre of BaTiOg. The ixoit cell is shown by the dotted line. 


Recently it has been shown (Megaw 1945; Rooksby 1945) that BaTiOg is not 
strictly cubic at room temperature, but tetragonal; the atomic parameters remain 
unchanged,* but the whole unit cell is uniformly stretched about 1 % in the 
direction of one of the cube edges, which becomes the 0 axis. 

Measurements made in the Material Research Laboratory of Philips Lamps, Ltd., 
of the variation of dielectric constant with temperature (Rushman & Strivens, 
1946) showed a curve of the type recently described by Wul & Goldman (1945 a; 
Wul 1945), with a conspicuous peak at about 120 ° 0 , and this immediately suggested 
that a change of structure might be involved. Investigations were accordingly 
made using a high-temperature camera in which the specimen could be examined 
over this temperature range. 

It was found that the tetragonal form did in fact change its structure at about 
this temperature, inverting to the ideal cubic form described above; further, that 
the transition involved no discontinuous change either of spacing or of axial ratio. 
The detailed course of the temperature variation of spacing and of axial ratio was 
followed, and will be described below. In addition, there were certain peculiarities 
observed in the intensities of the X-ray lines, which will be discussed. 

* The atomic parameters are fixed by the symmetry requirements if the lattice has a centre 
of symmetry. If there is no such centre, displacement along the tetrad axis becomes possible, 
and the space-group^ is lowered from to <7i„-P4 mm or This possibility 

is considered later ia the text. 


Temperature changes in crystal structure of barium titanium oxide 263 

The curve of the variation of dielectric constant with temperature ^shows also 
a subsidiary peak at about 0® C. This suggested that it would be of interest to 
study the structure below C. No accurately controllable low-temperature camera 
was available, and the experiments done were of the nature of a prehminary 
exploration only. While no change of structure was found at the lowest tempera¬ 
ture reached (about — 183 ° 0 ), there were changes of axial ratio which suggested 
that at a stiU lower temperature the structure might revert to cubic; and there 
were also intensity anomalies resembling those associated with the high-tem¬ 
perature transition. 

The small amount of silica present in the material, as described below, whether 
it forms a sohd solution or occurs as a separate phase, does not seem to have any 
important effect on the temperature changes, but further confirmation of this point 
would be desirable. 


1. High-tbmperattjrb transition 
Experimental 

The high-temperature camera used was that designed by A. J. C. Wilson (1941). 
It is a 19 cm. powder camera, with an oven surrounding the specimen, which can 
be kept at an accurately controlled temperature while the photograph is being 
taken. 

The specimen was made from a sample of BaTiOg containing 1*5 % silica, fired 
at 1300 ° C to form a sintered disk. From this a piece was cut, ground and etched 
to give a rod about 2 cm. long and 1 mm. in diameter. This was mounted on the 
axis of the camera and rotated during exposure. 

Copper radiation from a demountable tube was used, with a nickel filter. Ex¬ 
posures were about 1^ hr. 

Photographs were taken at 20° intervals from 100° to 200° C, and at 2° intervals 
in the neighbourhood of the transition point, as well as at room temperature. The 
temperature of the specimen could be controlled to within J° C, and explorations 
with a thermocouple showed that temperature differences in the air space sur¬ 
rounding the specimen did not exceed this. 

The photographs were measured with a comparator reading to 0*01 mm., and a 
modified Bradley-Jay extrapolation (Megaw 19466) was carried out to determine 
the cell-dimensions and axial ratiOi 

Interpretation of photographs 

Photographs taken at different temperatures are shown in figure 2, plate 5 . 
The high-angle lines are most important for the evaluation of cell-constants, and 
the others may be ignored. The cubic structure gives a set of doublets formed by 
reflexion from the same plane of the two wave-lengths and In the tetragonal 
structure each doublet is split into a set of partly overlapping doublets. The com¬ 
ponents of the highest-angle group, that with + are tabulated 



264 


H. D. Megaw 

below, with their Bragg 0 -values. (The theoretical intensity values refer only to 
relative intensities of lines of the same group on the same photograph.) 


Table 1 




theoretical 
intensity ratio 
(allo*wing 

' 

structiire 

indices and radiation 

for overlaps) 


cubic at 200® C 

(431 +501) 

2 

78*26 


(431 + 501) aa 

1 

78*96 

tetragonal at 20® C 

105ai 

1 

77*03 


314ai+ 105^2 

2-5 

77*84 


413ai + 314aa 

3 

78*57 


*(431 + 501 + 510) cCi + 413^2 

5 

79*43 


*(431 + 501 + 610) aa 

2 

80*24 


* Note. 431 and 601 have geometrically identical spacings; 610 differs very slightly, but 
is not resolved as a separate line. 


BesvMs 

It can be seen from figure 2, plate 6, how the lines of the tetragonal structure 
close together with increasing temperature, and finally coalesce in the cubic 
structure. This gives a measure of the decreasing axial ratio, which finally becomes 
unity. At the same time, there are changes in the relative intensities of the lines. 
These will be discussed later; but here it may be noted that they make accurate 
determinations of the axial ratio near the transition point very difficult. Lines 
characteristic of the tetragonal structure have been observed up to 118 ° C, and 
may possibly be present at 120° C. Above this, only cubic lines can be detected. 

The cell dimensions at the different temperatures are given in table 2. The last 
column gives the cell edge of those structures which are actually cubic, and the cor¬ 
responding mean spacing, or cube-root of the volume, of the tetragonal structures. 

Table 2 


temperature 

®C 

a 

kX 

c 

kX 

c/a 

'5^voL 

kX 

20 

3*9866 

4*0264 

1*0098 + 0*0001 

3*9996 + 0*0004 

. 90 

3*9920 

4*0192 

1*0068 ± 0*0002 

4*0011 ±0*0005 

100 

3*9926 

4-0162 

1*0059 ± 0*0002 

4*0005 ±0*0008 

116 

3*9962 

4*0122 

1*0040 ±0*0005 

4*0016 ±0*0008 

117 

3*9962 

4*0102 

1*0036 ±0*0010 

4*0010 + 0*0010 

117*6 

3*9967 

4*0087 

1*0030 ±0*0010 

4*0007 ±0*0010 

118 

3*9976 

4*0083 

1*0027 ±0*0010 

4*0011 ±0*0006 

120 

— 

— 

1*0000 ( + 0*0020) 

4*0011 ±0*0005 

122 

— 

— 

1 

4*0011 ±0*0006 

124 

— 

— 

1 

4*0012 ±0*0006 

126 

— 

— 

1 

4*0014 ±0*0005 

128 

— 

— 

1 

4-0010 ±0*0007 

130 

'— 

' — 

1 

4-0011 ±0*0006 

140 

— 

— 

1 

4-0016 ±0*0006 

160 

— 

— 

1 

4*0028 ± 0*0005 

180 

— 

— 

1 

4*0037 ± 0*0005 

200 

— 

— 

1 

4*0040 ±0*0005 



Temperature changes in crystal structure of barium titanium oxide 266 

These spacings have been plotted against the temperature in figure 3. The 
non-linear variation even below 116° C is apparent. Above this, between 116° 
and 130° G, the exact course of the curve is more uncertain. The full line represents 
its most probable course, which is not very different from its lower possible limit; 
its upper limit is given by the dotted line. Above 130° C the change of spacing is 
linear with temperature, within experimental error. 

hX 



Figxjbe! 3. Temperattire variation of cell dimensions of BaTiOa- (The vertical lines through 
the points represent the limits of experimental error,* those for observations near the transition 
point have, however, been omitted for the sake of clarity.) 

The mean spacing of the tetragonal structure has been plotted against tem¬ 
perature in the same figure. It is noteworthy that the slope of the line is appreci¬ 
ably different from that of the cubic structure above 130° C, though of the same 



266 


H. D. Megaw 

order of magnitude. In other words, the volume expansion coeificient, a constant 
for each structure over the temperature range studied, changes with the change of 
structure. Whether this occurs discontinuously at the transition point or gradually 
through intermediate values cannot be distinguished with the accuracy available. 
The values of the mean linear expansion coefficients, deduced from the slopes, are 
given in table 3. 

Table 3 

temperatmre range mean linear expansion 
structure ° C coefficient x 10® 

tetragonal 20-120 3’6 ± 1*5 

cubic 130-200 10 ±3 

The linear thermal expansion coefiBcients in the a and c directions have been 
calculated from the slopes of the corresponding curves in figure 3, and plotted 
against the temperature in figure 4. These are necessarily very sensitive to the 
particular way in which the spacing/temperature curve is drawn through the 
experimental points, and can only be relied on as far as their general trend is 
concerned. They , do, however, show at a glance how the coejfficients are nearly 
constant at room temperature and increase rapidly in numerical value towards 
the transition temperature. Whether the region about 120® C should be occupied 
by a discontinuity or by a sharp peak cannot be determined on the available 
evidence; it depends on the exact shape selected for the spacing/temperature curve 
between its possible limits. The fact that the c expansion, which is negative in sign, 
shows the same sharp increase in numerical value as the a expansion, which is 
positive, rather suggests that there is a discontinuity in both at the transition 
temperature. / 

The axial ratio is plotted against temperature in figure 6. Here again the most 
probable curve has been drawn with a full line, the upper limit with a dotted line. 

InUmity anomalies 

In a perfect lattice, the relative intensities are determined by the geometry of 
the lattice, the atomic parameters, and the atomic form-factors or /-values of the 
atoms. For BaTiOs with its atopis in the special positions quoted above there are 
no atomic parameters which can vary with temperature. The/-values, which include 
the effect of thermal vibrations, are of course temperature-dependent; but changes 
in these, even relative changes as between the different atoms, will affect all the 
lines of a single group in the same way and to much the same extent, except in so 
far as the changes are anisotropic—an important exception which must be con¬ 
sidered more fully later. The correction factor for polarization which has to be 
applied to the observed intensities may be neglected, since this varies continuously 
with 0, and the differences of 0 within a group are small. Thus, unless aniso- 
tropic thermal vibrations are involved, the relative intensities within a group are 



Temperature changes in crystal strVjCture of barium titanium oxide 267 


(—10X10~‘ 



— 5X10-* 

1 

1 

1 

1 

1 

1 

1 

1 

I 

a direction ^ i 

t 


- 

1 

cubic structure 
/ 

o'-' ' ' - 

—-»-—• ■!-1-4-1— 

no^c 

X 

c direction 1 

1 

1 

) t i 1 ‘"f"™ 1 ""f 

150 200®C 

-5X10-* 

1 

1 

1 

1 

1 

1 

1 


-10 X10"* 

1 

I 

1 

f 

1 

1 



Figtjbb 4. Linear expansion coefficients of BaTiOj. 



Figube 5. Temperature variation of axial ratio of BaTiOs. (The vertical lines 
through the points represent the limits of experimental error.) 





H. D. Megaw 


(431+501+510)ai 



(«) 


(431+501 + 510)«, 



( 6 ). 


41 Soil 




(c) (d) 



413ai 



Figure 6. Photometer curves of line 26 at different temperatures, 
(a) 20°-C, (6) 100°C, (c) 116°C, (d) 117^0, (e) 118®C, (/) 120*^0. 


( 


Temperaiure changes in crystal strttciure of barium titanium oxide 269 

practically independent of the temperature and of the axial ratio. In table 1 they 
are given for the high-angle lines which are of interest. They will be referred to as 
the ‘normal intensities’. 

Intensities were observed visually in the first instance. The observed intensities 
at room temperature agreed with the calculated, but at higher temperatures 
abnormahties were noticed, which fell into groups as follows: 

(а) Lines of high I index became progressively weaker relative to other lines 
with the same h^+h^ + P when the transition temperature was approached. This 
appeared first, and was most marked, with fine 105, but could also be seen with 314 
and with 224. 

(б) At temperatures within a few degrees of the transition, line 413 began to 
assume prominence, and increased in strength relative to the lines on either side 
of it tin the transition temperature was reached. 

■ (c) Near the transition point there was also a very marked increase in the 
backgroimd in the immediate neighbourhood of the lines. 

Photometric measurements of the photographs were made, and confirmed these 
observations. The curves are shown in figure 6a-/, after subtraction of background; 
in drawing these, the ordinates were chosen to give all the same area, for 
convenient comparison, as the experimental intensity scale is arbitrary and not 
comparable between different photographs. 

The decrease of distance between the peaks with increase of temperature is 
obvious and accounts for some of the diminution of importance of individual 
peaks. There is, however, a very striking increase of intensity of the 413 peak 
in the 116° 0 photograph, where it actually exceeds the 431-1-601-1-610 peak, 
hitherto strongest. This process is carried progressively further aib 117° and 
118° 0. 

To show up the same points more emphatically, a set of theoretical intensity 
curves was constructed for a normal tetragonal lattice with different axial ratios, 
and each matched against the experimental curve with the same axial ratio. For 
constructing such a theoretical curve, it was assumed to consist of a set of over¬ 
lapping Ojaj doublets, each of the shape of the doublet at 200° 0, where the 
structure is undoubtedly cubic. (Theoretically this doublet should be rather less 
sharp than at lower temperatures, but the difference is likely to be negligible for 
present purposes.) The ordinates of these doublets, each placed at its appropriate 
angle, were added to give the complete tetragonal curve. Figure 7 shows the 
results at 20°, 100° and 116° C. At 20° 0 the fit is good; the slightly greater sharpness 
of the experimental curve on the right-hand side probably comes within the limits 
of error. At 100° C the fit is reasonable, but the experimental 105 peak is lower than 
the theoretical, the experimental 413 peak higher. At 116° 0 the experimental 
413 peak is very much too high, and the 105 and 314 peaks are smoothed out 
against a high general intensity covering their expected positions. 

Such a change in the relative intensities is geometrically impossible in a perfect 
lattice, and it requires for its explanation either lattice imperfections of some kind 



270 


H.- T>. Megaw 

or the co-existence of two structures in varying proportions. The two effects, 
enhancement of the 413 line and weakening of lines of high I, must be discussed 
separately. 



(o) (d) 


Figxjue 7. Comparison of caloixlated (-) and observed (--) intensity curves, (a) 20® C, 

{b) 100® C, (c) 116®C, calculated for all the material tetragonal, (d) 116®C, calculated for 25 % 
of the material cubic, 75 % tetragonal. 

Intmsity of the 413 peak ' 

While imperfections (using the term here to include thermal vibrations) may 
alter the relative intensities of different planes, the magnitude of the effect for any 
given plane will in general lie between the extreme values for the 001 and 100 planes, 
and in no case can be very much greater than for one of these. But the experi¬ 
mental fact here shows that 413 increases very greatly relative to both 106 (lying 
nearly parallel to 001) and 431 (nearly perpendicular to 001). Hence one must fall 
back on the explanation that a second structure has appeared, giving a strong line 
in the same position as the tetragonal 413. This condition would be satisfied by a 
cubic structure of the same cell-volume as the tetragonal, since their 413 spacings 
differ by less than 0-01 %. Other lines of this cubic structure would coincide with 
the corresponding tetragonal lines, which at the temperatures concerned are in 
any case unresolved single fines. Hence the observed intensity changes indicate 






Temperature changes in crystal structure of barium titanium oxide 271 

the co-existence of a cubic and a tetragonal structure of the same cell-volume (or 
very nearly the same) over a temperature range of a few degrees near the 'transi¬ 
tion point’, which thus loses its exact meaning. There is in effect a transition range. 

On this assumption, a theoretical intensity curve was constructed for 116° C, 
taking 26 % of the material as cubic and 76 % as tetragonal. This curve, shown in 
figure 7 {d), is a much better approximation to the experimental curve than is the 
pure tetragonal. At 118° C, a reasonable fit is obtained with a curve constructed 
for 67 % of the material cubic. 

As quantitative estimates of the proportions of cubic and tetragonal material 
these are of course very rough, but they do illustrate the point that the proportion 
of cubic increases with the temperature. 

That the effect is reversible, and not merely due to slow changes produced when 
the specimen is held at the high temperature, is shown by the fact that photographs 
taken at 117° and 117*5° C, at a time subsequent to all the others, fit into inter¬ 
mediate positions between the 116° and 118° C photographs. 

Intensity of the 105 peak 

In the above section, the working conclusion was reached that a cubic structure 
appears at some temperature below 120° C and gives a line overlapping with the 
tetragonal 413 Kne. This leaves for consideration the relative changes of the three 
remaining peaks, 106, 314 and 431-1-501-1-610. 

Any imperfections affecting the distribution of scattering matter in the c direc¬ 
tion will have a maximum effect on reflexion from the 001 plane and zero effect 
on planes perpendicular to this. This type of imperfection will therefore affect 
106, 314 and 431 in decreasing order of magnitude. This is in accordance with the 
experimental results, where the weakening of 105 is most marked while 314 is also 
noticeably weakened. In. the next group of Hues, weakening of 224 relative to 422 
is also observable. 

There is thus evidence of changes affecting the c but not the a direction as the 
crystal approaches the transition range. They occur to an appreciable extent at 
temperatures below those where any noticeable amount of cubic material is formed. 
Possible mechanisms for them will be discussed later. 


2. Low TEMPEBATTJRE OHAKOES 
Experimental 

There was no camera ready available for low-temperature work, and circum¬ 
stances limited what could be done to a quick general exploration. 

A simple back-reflexion camera was used for inost of the work. It was adapted 
so that a cylindrical film-holder, of about 6 cm.- diameter, could be mounted round 
the specimen; this enabled a complete powder pattern to be obtained. No pre¬ 
cautions were taken for centring the specimen on the axis of the camera, and thus 



272 H. D. Megaw 

the photographs were not suitable for measurement; but this was irrelevant to the 
main point at issue, as visual examination would suffice to tell whether the low- 
temperature pattern was essentially the same as that at room temperature. 

Tor more detailed examination of the high-angle lines, a flat film was used in 
the back-reflexion holder,, about 5 cm. from the specimen. A longer distance would 
have been preferable for increased accuracy, but would have prolonged the ex¬ 
posure times inconveniently. 

The film was wrapped in a black paper envelope. In front of it was placed a 
film which had been uniformly exposed to light and developed. This formed a thin 
silver screen which absorbed most of the incoherent secondary radiation scattered 
by the titanium atonas, while only increasing the exposure time by a factor of 
about 2. 

Two different types of specimen were made, both using finely powdered material 
from a fired disk. In one, the powder was packed into a thin celluloid tube; in the 
other, it was mixed with gum tragacanth, moistened, and rolled into a rod. Both 
kinds were roughly 6 mm. long and from 0-6 to 1 mm. in diameter. They were 
stuck, either with gum tragacanth paste or by moistening the celluloid with ace¬ 
tone, to a copper wire, which was coded in a spiral about a knob in the bottom of 
a small Dewar cup, and secured in place by the same adhesive. The cup was 
mormted on the rotating axis of the camera so that the X-ray beam just cleared 
its rim. The specimen could be centred on the axis of rotation by manipulating the 
copper wire. The arrangement is shown in figure 8. 



fiGmuBi 8. Experimental arrangement for low temperature photographs 
(showing cylindrical film holder). 

The low temperatures used were the fixed points given by solid carbon dioxide 
( —78°C) and liquid oxygen (-183° 0). Several different methods of achieving 
these were tried. 

For — 78° C, small lumps of solid COj were fitted into the cup, which was filled 
with either methyl alcohol or acetone; the former tras used with the celluloid 
tube, the latter with the gum-tragacanth rod. In both cases the specimen softened 
and bent slightly, but was not seriously displaced. The formation of ice crystals 




Temperature changes in crystal structure of barium titanium oxide 273 

(hoar-frost) on the rim of the cup and on the specimen was prevented by washing 
these surfaces at intervals with cold liquid from the cup, applied with a small 
paint-brush. 

The part of the specimen projecting above the cup will, of course, be at a tem¬ 
perature rather above — 78° C. However, the ready formation of ice on it, if it is 
left unwashed, indicates that it is well below 0° C; that it is reasonably constant 
throughout the exposure in the part of the specimen irradiated may be deduced 
from the intemal evidence of the photographs, namely the sharpness of the 
individual spots. In all these photographs a stationary specimen was used. 

For liquid air temperatures, one photograph was taken with a stationary 
specimen, the cup being kept replenished with liquid air. In another photograph, 
the specimen was rotated, and a continuous stream of liquid air allowed to trickle 
down it from a funnel with a narrowed stem. Only back-ireflexion photographs 
were taken with liquid air. 


ResvlU 

Photographs taken on the cylindrical film at room temperature and with solid 
CO 2 showed essentially the same structure, giving the normal pattern for BaTiOg. 
Close inspection, however, revealed small differences in the relative separations 
and intensities of high-angle lines, notably the group of lines with = 26. 

This group was examined in more detail with the back-reflexion camera. 

In these photographs it was not possible to identify the separate lines of the 
group with absolute certainty. The difficulty is not whoUy due to the rather primi¬ 
tive technique, but is partly inherent in the material. Low-temperature photo¬ 
graphs with stationary specimens show individual spots about as sharp as those at 
room temperature (showing that variation of temperature during the exposure 
may be neglected) but the background of smaller spots is drawn out into a woolly 
band indicating a large amount of strain inhomogeneity. The same effect produces 
a blurring together of the lines in the photograph with the rotating specimen, in 
marked contrast to their good definition at room temperature. It resembles the 
blurring noticed just below 120° C, though it is still more pronounced, and it is 
attributable to the same cause. Typical photographs are shown in figure 9, plate 5. 

In spite of this lack of clarity, two points stand out with certainty: 

(i) In all the low-temperature photographs the axial ratio is markedly less 
than at room temperature. 

(ii) At CO 2 temperatures the intensity relations of the lines are abnormal. 

Measurement of the photographs gives the following estimate of the axial ratios; 

WithsoHdCOa c/a = 1-0086 ± 0-0010. 

"With liquid air cja = 1-0030 + 0-0020. 

The intensity relationships with solid OO 2 may most probably he accounted for 
by assuming that two effects are occurring simultaneously, as in the region of the 



274 


H. D. Megaw 

high-temperature transition: (a) the co-existence of a cubic and a tetragonal 
structure, ( 6 ) a weakening of certain planes of the tetragonal structure relative 
to the rest, in this case the planes which lie most nearly parallel to the tetrad axis. 
A similar interpretation seems compatible with the hquid air photograph, though 
the blurring renders it much less certain. 

The decrease of axial ratio suggests the existence of a low-temperature transition 
‘point’ below which the structure reverts to cubic. The anomalous intensities 
suggest that this transition is of the same nature as the 120 ° change, though 
extended over a greater temperature range. Kgure 10 shows the variation of 
ratio with temperature from — 200 ° to - 1 - 200 ° C, a dotted line being used 
where its course is only roughly known. 



Figube 10. Variation of axial ratio with temperature. (The vertical lines 
through the points represent the limits of experimental error.) 

Disoussion- 

The transition as a X-point change 

The changes occurring at the transition point of BaTiOg are obviously co¬ 
operative phenomena; the shape of the curves in figures 3 and 5 is that character¬ 
istic of a thermodynamic change ‘of the second order’. This phrase was originally 
introduced (Ehrenfest 1933 ) to describe transitions where the changes of energy 
and volume were zero but the specific heat and thermal expansion showed dis- 
contmuities or anomalous increases. Whether this definition can be strictly upheld 
is now in question, and Ubbelohde & Woodward (1945 a) would prefer to use the 
expression ‘continuous changes’. They are often called ‘A-point phenomena’ after 
the characteristic A-shaped curves for the specific heat and the thermal expansion. 
They include the order-disorder changes in alloys, ferromagnetism, the transitions 
in ammonium chloride, methane, Rochelle salt, potassium di-hydrogen phosphate, 



Temperature changes in crystal structure of barium titanium oxide 275 

and the a-yff transition in quartz. Originally they were believed to occur without 
change of structure, but recent work (XJbbelohde & Woodward 1945 a) has shown 
the change of structure acconipan 3 dng the appearance of abnormal physical 
properties in RocheUe salt and KH 2 PO 4 . A comparison of BaTiOs with these two 
compounds is likely to be particularly illuminating, since it, like them, must be 
classed as a ferroelectric. In some of the compounds mentioned, for example, 
methane, NH 4 CI, and Rochelle salt, the transitions have been attributed to the 
setting in of rotation of certain molecules or ions, but this is by no means con¬ 
clusively proved for any of them, and can certainly not be true for KHjPO^, 
quartz, or BaTiOs. * 

The present work brings forward no experimental evidence on the variation of 
the specific heat of BaTiOs with temperature, but it is said by Wul & Goldman 
(1945 b) to show ‘ an abrupt variation of the specific heat at a temperature of about 
126° 0’.* (On the other hand there is evidence from the work of Bantle ( 1942 ) on 
KD. 2 PO 4 , another feiroelectrio, that the specific heat anomaly may be structure- 
sensitive, and depend on the existence of sufficiently large Weiss domains.) The 
thermal expansion curves in figure 4 have the characteristic A shape. There is no 
detectable volume ffiscontinuity, a fact which brings it within the original defini¬ 
tion of a second-order change (which would exclude, for example, that in NH 4 CI). 
In this it closely resembles the a-^ transition in quartz, where X-ray examination 
of the spacii^s (Jay 1933 ) showed curves very similar to those of figure 3, except 
that for quartz both expansions are positive and the high-symmetry form is not 
cubic but uniaxial. In quartz, as in BaTiOg, the linkage of the oxygen polyhedra 
remains unaltered throughout the transition, but in quartz they are able to tilt 
slightly from their high-symmetry positions, a possibility which in BaTiOg is 
incompatible with the retention of the small luiit cell actually observed. 

The double transition suggested for BaTiOg is paralleled by that observed in 
RocheUe salt. In it the high-symmetry form (here orthorhombic) exists above 
24-6° and below — 20 ° C, and the monoclinic structure in the intervening range is 
derived from the orthorhombic by a very slight distortion (XJbbelohde & Wood¬ 
ward 1946 ). As in BaTiOg (Megaw 1946 a), the low-symmetry ‘crystal’ is an 
intimate twin of small regions oriented in certain specified directions relative to 
the original high-symmetry single crystal; and the change from single crystal to 
twin is reversible. The same effect has been studied in KH 2 PO 4 (de Quervain 1944 ; 
XJbbelohde & Woodward 1945 a, 6 ), where the high-symmetry form is tetragonal. 
In both these the abnormal electrical properties are again associated with the 
low-symmetry range. 

The co-existence of two forms over a finite temperature range has bedn observed 
in NH4CI, and the mechanism of the process has been very elegantly explained by 

* {Note added in proof.) A recent paper by Wul (1946 6 ) quotes measurements by Sokolov 
showing a maximum for the thermal capacity in the same region as for the dielectric constant. 
The same paper gives a very interesting thermodynamic treatment of the dielectric properties, 
though not directly linking it with the nature of the structural change. 



276 H. D. Megaw 

Dinichert & Weigle ( 1944 ; Diniohert 1944 ). They suppose that, as the temperature 
rises, small regions of the high-temperature form which appear locally do not break 
away from the parent lattice, but produce local strains in their surroundings, which 
raise the transition temperature for the strained material. The different thermal 
expansions of the two phases then bring about the hysteresis which is character¬ 
istic of the change. Such hysteresis has not been observed in BaTiOs, but the 
experiments were not designed to detect it. 

In BaTiOs, a change involving expansion in the a direction and contraction in 
the c direction will be facilitated by anisotropic local stresses where compressive 
forces in the c direction predominate. Crystals subject to such forces will invert 
to cubic at a lower temperature than those where the stresses are isotropic. Since, 
in a polycrystalline mass, there will be varying local stresses on the different 
crystallites, the structure change in the material as a whole will take place over a 
range of temperature, and will be reversible (though possibly with hysteresis) as 
long as the texture remains unchanged by recrystallization or crystal growth. 
Evidence of this reversibility has been obtained for (Ba, Sr)Ti08 (Megaw 1946 a), 
and less definitely for BaTiOg (unpublished observations). Calculations of the 
differential stresses required to produce a given change in the transition point of a 
crystallite have not been carried out, but, since the difference of volume between 
the two structures in the neighbourhood of the transition point is so small, the 
equilibrium temperature is likely to be very sensitive to such stresses. It is inter¬ 
esting in this cormexion to notice that a dependence of dielectric constant on 
pressure has been observed by Wul & Vereshchagin ( 1945 ; Wul 19466 ). 

It is only possible to guess at a mechanism which would explain the A-point. 

I am indebted to Dr Eisenschitz for the suggestion that one possibility is the 
assumption of two quantum states for the titanium atom, which do not combine, 
and which are degenerate for cubic symmetry but split up for tetragonal; to allow 
for the attainment of equilibrium between the two phases, it must be postulated 
that combination between the two states can occur if the crystal symmetry is 
lower, as it may be momentarily and locally owing to thermal vibrations. The 
atomic transitions wiU in this case be qrute independent of one another, and the 
co-operative character of the change as a whole is only achieved by the accom¬ 
panying alteration in atomic positions; in this it differs from ferromagnetism, 
where the atomic transitions are themselves co-operative. 

Another possible mechanism which might give a A-point is the existence of two 
equilibrium positions (one of them perhaps metastable) for one of the atoms. It 
is necessary before discussing this to consider a little more fully the evidence for 
locating the Ti atoms at the 000 positions. Figure H shows a section of the 
structure in the ( 100 ) plane at height a: = 0 . It has always been assumed that the 
structure has a centre of symmetry, as drawn in the figure; but if the centre were 
absent, displacement of atoms along the c axis would be permissible. It has not 
been possible so far to obtain conclusive evidence from piezo- or pyro-eleotric 
tests. The agreement of calculated and (visually) observed intensities in the room- 



Temperature changes in crystal strudure of barium titanium oxide 277 

temperature structure shows that the displacements, at least those of the cations, 
are not large, but does not rule them out completely. The abnormal sterio 
conditions in BaTiOs, which wiU be discussed more fully in the next section, 
allow small displacements of the atoms lying on the tetrad axis to be made 
without bringing them closer together than the sum of the Goldschmidt radii. 
Displacements with a statistical distribution of plus and minus signs might 
not give rise to a noticeable piezo-electric effect, and yet would not require a 
larger tmit cell. 



Fioukb 11, Section, of straoture of BaTiOg in plane (100) at height 85=0. The small circles 
represent Ti ions, the large circles O ions, drawn to scale using the Gfoldsohinidt radii; the 
dotted lines outline the unit cell. 

The Ti-0 distances in the c direction are 2-01 kX at room temperature, as com¬ 
pared with l*96kX for the sum of the Goldschmidt radii. Thus either the Ti or the 
0 atoru may be displaced by as much as 0*06 kX towards one of its neighboura; but 
at the same time the distance from its opposite neighbour is increased, so that the 
move may result in a gain in energy of the lattice as a whole. However, there 
may be some displaced position which is metastable with respect to the sym¬ 
metrical position, and the existence of two discrete energy levels could be used to 
account for the A-curve. It is not necessary to assume that any large nmnber of 
the metastable positions are occupied at room temperature. This suggestion of a 
mechanism is only very tentative, but it seems compatible with the general 
properties of the structure. 

i8 


Vol. 189 . A. 






278 


H. D. Megaw 

The tetragoncU structure 

So far, no explanation of why the structure is tetragonal has been advanced. 
It is generally accepted that baduiu titanium oxide is an isodesmic compound 
(Evans 1938 ), since the forces throughout the lattice are essentially of the 
Mnd; the electrostatic bond strengths are aU less than unity, hav ing the values 
2/12 and 4/6 respectively for the Ba-0 and Ti-0 coordination polyhedra. It is 
most naturally considered as an ionic structure, since it satisfies all the usual 
mteria for ionic structures, with one exception: the difficulty arises in explaining 
its tetragonal symmetry, since that symmetry is intrinsic in the environment of 
each atom and is not merely caused by the geometrical building together of the 
co-ordination polyhedra. 

The departure from cubic symmetry cannot be explained by purely steric 
considerations. In all cubic compounds of the type A^+B^+Og the cell-dimensions 
are determined by the network of linked B^+Og octahedra, the large cation accom¬ 
modating itself in the space between the octahedra. If this cation is rather too 
smaU for the hole, as in CaTiOg, it causes a collapse of the oxygen firamework of 
such a kind that the octahedra retain their dimensions and linkage but not 
symmetrical orientation (Naray-Szabo 1943 ). BaTiOg is the unique example of a 
compound with a cation just too hrge for the available hole (Megaw 19466 ). The 
oxygens round the Ti*+ ion are forced apart, leaving abnormally large li-O dis¬ 
tances, 1-99 and 2-OlkX in the o and c directions respectively, as compared with 
l*96kX for the sum of the Goldschmidt radii. The Ti ion can ‘rattle’ in its oxygen 
oct^edron. On purely steric grounds one might expect either the retention of a 
cubic lattice with aU the distances equally too large, or the formation of a tetra¬ 
gonal lattice in which some of the distances, say those in the ( 001 ) plane, remained 
normal; or possibly the ocoTirrenoe of some quite different modification where by 
means of lowered symmetry or a larger oeU the normal distances were achieved; 
but not a structure in which the distances are dtt abnormally large but different 
in the different directions. 

Since it is thus impossible to account for the existing structure by means of 
purely central electrostatic forces, one is driven to postulate that there are directed 
bonds determining the uniaxial symmetry. Accepting Pauling’s view that bonds 
may be partly covalent and partly ionic, it is only necessary to assume a small 
amount of this covalent character superimposed on a maialy ionic set of bonds m 
order to explain the phenomena observed. ' 

Since the ^11-edge a in the room-temperature structure is shorter than c, the 
bonds must lie in the ( 001 ) plane. It is natural to assume that they have a square 
formation iu this plane. 

Evidence for directed bonds I 

, directed bonds can be attributed with any plausi- 

If the compound is strictly stoichiometric, we are concerned 
witb Tiw There is evidence that this forms octahedral bonds in the compounds 



Temperature changes in crystal structure of barium titanium oxide 279 

TiS 2 , TiScg, TiTeg. Pauling & Huggins originally ( 1934 ) considered these bonds to 
be but later ( 1937 , 1941 ) changed their opinion and concluded that they 

are 4a4p®4d®, i.e. that they make use of the unstable d orbitals of the valence shell 
itself. Huggins states that these are more polar than the 3d®4s4p® bonds, and 
should predominate for bonds coimecting a metal atom to very electro-negative 
atoms. These 4cs4p^4d? bonds have properties intermediate between truly ionic 
and truly covalent bonds; their relationship to the ionic makes more plausible 
their appearance in such a compound as BaTiOg. 

The Ti-0 distance which should be found with such bonds is given by the sum 
of the Pauling covalent radii as 2-02 kX. Now, as pomted out above, it is an 
experimental fact that the Ti occupies an abnormally large volume for an ionic 
structure, the Ti-0 distances being 2-01 and l’99kX as compared with l*96kX 
for the interionio distance normally to be expected. The bond orbitals are very 
sensitive to distance; and it seems probable that this abnormally large volume may 
favour the formation of a covalent bond system which is associated with longer 
Ti-0 distances than the ionic. Because of this same sensitiveness, small changes 
in the interatomic distances due to thermal expansion may suffice to bring about 
the changes in the bond system which are needed to account for the charge of 
structure. 

There is no previous evidence whatever for the occurrence of square bonds with 
Ti, though they are of course well known for the transition elements. The square 
complex for Ti^ may only be made possible by the very abnormal volume available 
to the atom. 

On the other hand, it is possible that BaTiOg is not stoichiometric, but contains 
a small number of Ti®+ with corresponding vacant oxygen sites. It has not been 
possible to analyse the material with the necessary accuracy to test this. The large 
Ti volume would favour the occurrence of Ti®+, which has a radius of 0-69 A as 
compared with 0-64 for Ti*+. 

Nothing is known about the covalent bond system of Ti™'. However, a different 
kind of bond postulated by Klemm ( 1939 ) to occur between cations with incom¬ 
plete outer shells, attributable to anti-parallel spins, has been invoked by Ehrlioh 
( 1939 ) to explain the magnetic susceptibility of TiOg on reduction. These bonds 
are not formed by Ti*+, but as soon as reduction has given rise to some Ti®"*" bond 
formation is possible. If there are veiy few Ti®+ ions, they will be so widely separated 
that the number of bonds is extremely small; as reduction increases the number 
of Ti®'''’s, the proportion of bonds increases and the propoirtion of ‘free’ Ti®+’s 
drops. This accounts satisfactorily for the variation of paramagnetic susceptibility 
with the composition. The amount of reduction required to produce an appreciable 
number of bonds, while only a few per cent, is accompanied by very drastic colour 
changes. 

If there is free Ti®'*' in normal BaTiOg it should be shown by its paramagnetism, 
but so far no study of this is available. The colour of the material, which is typically 
pale yellow, does not suggest that there is very much reduction. 



280 


H. D. Megaw 

s 

Suggested explanation of the transition 

Without making any detailed assumptions about the origin of the directed bonds, 
it is possible to use the idea to explain the temperature changes of structure. A 
system is postulated of directed bonds superimposed on a mainly ionic structure, 
and only making a small contribution to the lattice energy; it is further postulated 
that these directed bonds will tend to weaken or disappear at high temperatures. 
To fit the experimental observations, it is assumed that at low temperatures the 
direction of the bonds is octahedral, and that the breakdown of the system takes 
place in two steps, giving as an intermediate stage a square bond complex. 

Consider first the high-temperature transition, which has been the more fuUy 
examined. The directed bonds in the room-temperature structure lie entirely in 
the (001) plane, and as the temperature rises they break. There is no discontinuous 
change in lattice energy, and hence this disappearance of the directed bonds means 
a redistribution of the attractive potential, the loss in the part attributable to 
directed bonds being compensated by a gain in the ionic part, which is effective 
in all directions and thus tends to reduce the c spacing. This accounts for the 
abnormal linear expansions in the two directions, while the volume expansion 
rmnains roughly normal. It also suggests an explanation for the change in mean 
linear expansion coefficient at the transition point; the same temperature-inorease 
of energy can bring about a greater increase of interatomic distance when acting 
against the purely ionic forces above the transition point than when it has to over¬ 
come the stronger, partly covalent, forces below it. 

At low temperatures the directed bonds coincide with the ionic, both being 
satisfied by a cubic structure. The low-temperature transition is brought about by 
the breaking of those directed bonds which lie in the c direction. 

Origin of anomalous iniensities 

Consider now the anomalous intensity changes accompanying the transitions. 
For the high-temperature transition, this was shown to involve changes in the 
distribution of scattering matter in the c direction but not in the a direction; while 
for the low-temperature transition it was concluded tentatively that the reverse 
was the case. 

Displacements of either Ti or 0 atoms of the kind suggested earlier in the paper 
would bring about intensity changes in the right direction- but of insuflficient 
magnitude. For example, displacement of the Ti atoms by 0-06 A (the full amount 
allowed in the room-temperature structure by the Goldschmidt radii) would only 
reduce the intensity of the 106 line by about 5 %; and the distance available for 
such displacement becomes rapidly less with temperature, just over the region 
where the intensity differences increase most markedly. 

There are four other possible explanations of the weakening of reflexions of 
high 1. 



Temperature changes in crystal structure of hcvrium titanium oxide 281 

( 1 ) The amplitude of thermal vibrations parallel to the c-axis becomes very 
large. 

( 2 ) There is a random displacement in the c direction of individual atoms fi:om 
their lattice positions. 

(3) The single crystals may break up into small crystaUites or ‘islands’ which 
are plUite-hke parallel to ( 001 ), their thickness perpendicular to this being small 
enough to lead to diffraction broadening. 

(4) Individual crystals may show a range of c spacings brought about by local 
stresses, with consequent broadening of the 105 hne. Since the stresses are likely 
to be roughly isotropic, this must imply that the Young’s modulus in the c direction 
is very much smaller than in the a direction (a not umeasonable supposition for a 
material where the two thermal expansions are of opposite sign). 

These four mechanisms are not entirely distinct. The last three differ from one 
another chiefly in the size of the regions throughout which they require the c 
displacements to be of the same kind; even in (2) there must be some sort of local 
order. For individual crystals which are large enough to give discrete spots in 
photographs with a stationary specimen, they should, in their extreme forms, be 
distinguishable experimentally by the character of the distortion of spots on the 
105 line at a temperature just below the transition point. However, in practice the 
inhomogeneity in texture of the material masks this. AH three effects may be 
operative, but to a different degree in each crystallite, depending in its size and 
the local stresses. 

It is possible to put forward a crude picture to show why the random displace¬ 
ments postulated under ( 2 ) should be in the c direction at the upper transition 
point. Suppose that there is originally a square network of a bonds, and that one 
such bond is broken. The repulsive forces tend to make the atoms move farther apart, 
but they cannot move in the plane because of the rigidity of the square network 
which remains. However, there is only a very small component of the directed 
bond preventing their movement out of the plane, and as such movement is 
favoured by the increase in ionic attractive force in the c direction it accordingly 
takes place. 

The occurrence of thermal vibrations of large amplitude would lead to much 
the same physical picture. These imply a weakened restoring force in the c direction, 
a result which would be expected from the breaking of the c-bonds. Since there is 
not room for large enough amplitudes to be achieved by individual atoms in¬ 
dependently of their neighbours, the instantaneous description of the crystal 
would show some degree of order, as would also occur with static displacements. 
So far as the present work goes, both these possibilities remain open. 

As increasingly large numbers of bonds are broken, a progressive crumpling-up 
of the (001) plane may ensue, in which the atoms are displaced, or vibrate, per¬ 
pendicular to the plane with only local order; or the displacements within a given 
region may all be of the same kind, resulting in an island of tetragonal structure 
as under (3). 



282 H. D. Megaw 

The existence of such islands has been shown by Bradley ( 1940 ) for certain 
Fe-Cu-Ni alloys cooled too quickly for equilibrium. In these, the equilibrium 
structure consists of two face-centred cubic phases with slightly different dimen¬ 
sions. The partly quenched material contains islands sharing a common ( 001 ) 
plane with the parent structure, but having different axial ratios, 1*0095 0*992 

in a typical example. These islands are platelets about 100 A thick in the c direction 
which have not yet broken away from the parent lattice to assume their own 
characteristic a-spacing. There is a similarity here to the mechanism postulated by 
Dinichert & Weigle ( 1944 ) for NH^Ol. Owing to the thinness of these plates in the 
c direction, they show diffraction broadening for lines of high I, the 004 line being 
washed out completely against the background. The experimental evidence very 
closely resembles that for BaTiOg, except that in the latter case there is only one 
tetragonal lattice involved, with an axial ratio greater than unity. There is, how¬ 
ever, one important difference in the conditions: in the Fe-Ou-M alloy, the effect 
is essentially one of incomplete equilibrium, the partial separation having been 
‘frozen in’ by subsequent ooohng, whereas in BaTiOs as in NH 4 CI there appears 
to be at least quasi-equiiibrium. 

For the low-temperature transition, any suggestions must remain very tentative, 
as the experimental evidence is so incomplete. However, a picture analogous to 
•that for the high-temperature transition appears to be plausible. The breaking of 
the c-bonds increases the strength of the o-bonds. These will not allow movement 
out of the ( 001 ) plane, which would increase their length, but instead give local 
displacements in the plane, regions where the shorter spacing predominates alter¬ 
nating with others where the c-bonds are unbrokep and the a-bonds not shortened. 
This results in weakening of the MO reflexions, while the OOZ are unaffected. 

I should like to express my thanks to Dr D. F. Bushman in connexion with 
whose work this investigation arose, and I am greatly indebted to Sir Lawrence 
Bragg for permission to use the high-temperature camera in the Crystallographic 
Laboratory, Cambridge, for a part of the work. 

]^ally I wish to thank Mr van MoU and the Directors of Philips Lamps Ltd., 
in whose research laboratory this work was done, for permission to publish this 
paper. 


Rbfbebnoes 

Bantle, W. 1942 Hela. Plvya. Acta, 15, 373. 

Barth, T. 1925 Norsk. Geol. Tidsahr. 8 , 201. 

Bradley, A. J. 1940 Proo. Phys. Soo. 52, 80. 

Dinichert, P. 1944 Helv. Phys. Acta, 17, 389. 

Dinichert, P. & Weigle, J. 1944 C.B. Soo, Phys. Oenksie, 61, 173. 
Ehrenfest, P. 1933 Oomm. Leiden Suppl. 75b. 

Ehrlich, P. 1939 Z. MeMrodiem. 45, 362. 

Evans, R. C. 193 ^ Crystal ohemisi/ry. Camh. XJniv. Press. 








Temperature changes in crystal structure of barium titanium oxide 283 

Huggins, M. L. 1937 J. Chem, Phya, 5, 527. 

Jay, A. H. 1933 Proc. Boy, Soc, A, 142, 237. 

Klemm, W. 1939 BleJctrochem, 45, 583. 

Megaw, H. D. 1945 Nature, 155, 484. 

Megaw, H, D. 1946 a Nature, 157, 20 . 

Megaw, H. D. 19466 Proc, Ph/ys, Soc. 58, 133. 

Naray-Sssabo, S. 1943 Naturwissenschaften, 31, 202 . 

Pauling, L. 1939 Nature of the chemical bond. New York: Cornell Univ. Press. 
Pauling, L..& Huggins, M. L. 1934 Z. Kristallogr. 87, 205. 

Qu^rvain, M. de 1944 Heh. Phys, Acta, 17, 609. 

Rooksby, H. P. 1945 Nature, 155, 484. 

Rushman, D. F. & Strivens, M. A. 1946 Faraday Society Discussion on Dielectrics. 
(In the Press.) 

Ubbelohde, A. R. & Woodward, I. 1945 a Nature, 155, 170. 

Ubbelohde, A. R» & Woodward, I. 19456 Nature, 156, 20. 

TJbbelohde, A. R. & Woodward, I. 1946 Proc, Boy. Soc. A, 185, 448. 

Wilson, A. J. C. 1941 Proc. Phys. Soc. 53, 235. 

Wul, B, 1945 Nature, 156, 480. 

Wul, B. 1946 a Nature, 157, 808- 
Wul, B. 19466 J. Phys. U.S.S.B. 9, 95- 

Wul, B. M. & Goldman, I. 1945 a C7.P. Acad. Sci. U.B.S.S. 46, 139. 

Wul, B. M. & Goldman, I. 19456 C.B. Acad. Sci. U.B.S.S. 49, 177. 

Wul, B. M. & Vereshchagin, L. F. 1945 G.P. Acad. Sai. U.B.S.8. 48, 634. 


Description op Plate 5 

Figure 2. High angle lines on powder photographs of BaTiOa (Cu Kac radiation), (a) 20°C, 
( 6 ) 100 ° C, (c) 117°C, (d) 200 ° C. 

FiotTRE 9. Powder photographs of BaTiOj, to show low temperature changes. (Back re¬ 
flexion on flat film with CuBa radiation.) (a) Specimen stationary, temperature about 20° 0; 
( 6 ) same specimen as (a), stationary, temperature about — 78°0; ( 0 ) same specimen as (a), 
stationary, temperature about — 183°C; (d) specimen rotated, temperature about 20° 0 
(a tungsten wire was embedded in the specimen, and gives a line* overlapping with the outer¬ 
most line of the group N=26 due to BaTiOs); (e) same specimen as (d), rotated, temperature 
about — 183°C (the line due to the tungsten wire is visible, marked W). 



Visual adaptation in relation to brief conditioning stimuli 

By B. H. Ceawfobd, M.So. 

{Communicated by T. Smith, F.B.8.—Received 24 July 1946) 

(The full text of this paper is printed in Proc. Boy. Soc. B, 134, 283.) 

Absteaot 

Measurements are made of the changes in state of visual adaptation before and 
after a change in conditioning stimulus. Liminal test stimulus is taken as a measure 
of visual adaptation. 

1 . The liminal test stimulus starts to rise sharply about 0*1 sec. before the eye is 
exposed to the conditioning stimidus. There is also a corresponding, but less marked, 
rise before the conditioning stimulus is cut off. The remaining sections refer to 
recovery of dark adaptation after conditioning stimuli of brief duration. 

2 . The course of recovery of dark adaptation is the same for both positive and 
negative contrast of the test stimulus against the general background, the numerical 
values of the.contrast being equal. 

3. The effect on recovery of dark adaptation of a steady illumination of the field 
of view is found to be nil until the state of adaptation corresponding to the field 
illumination is reached, when recovery ceases. 

4. The effect of area of test stimulus on the shape of curves of recovery of dark 
adaptation may be eliminated by the equivalent background brightness transforma¬ 
tion; the curves of recovery of equivalent background brightness are coincident 
for all areas of test stimulus. 

6 . Recovery of dark adaptation was measured under standard conditions for 
twenty-six subjects. Personal variation was not reduced by ma,king the equivalent 
background transformation, indicating that the variation is due to real differences 
in rate of recovery of the photochemical mechanism of the eye. 

6 . Recovery curves are given for two subjects for a wide range of intensities of 
initial conditioning stimulus. 

7. A comparison of recovery curves after the two types of conditioning stimulus— 

a fiash covering a large central area of the field of view; a flash covering a small 
excentric area of the field of view—shows that the equivalent background formula, 
yj = already well established for static conditions, appUes to this case also, 

(yff is equivalent background brightness, E is illumination at the subject’s eye, d is 
the angle subtended at the eye between conditioning and test stimuli.) 

[ 284 ] 



Visual ada/ptation in relation to brief conditioning stimuli 285 

8 . The efifect of a succession of conditioning stimuli has been reduced to a simple 
formula, at least up to a total duration of the succession of 5 sec. Up to 0*3 sec. the 
eye integrates perfectly the series of conditioning stimuli. Afterwards, integration 
becomes progressively less perfect according to the relation 

where e is the intensity of the equivalent single stimulus, E is the total intensity of 
the multiple stimulus, pO is 0*3, and p is the total duration in seconds of the multiple 
stimulus. 

9. Experiments with conditioning stimuli of dififorent colours show that, within 
the range of these experiments, it is the scotopic brightness of stimulus which is 
related to subsequent recovery of dark adaptation, not the photopic brightness. 

10 . Experiments in the laboratory with natural test objects (lantern slides of 
various scenes) show that, when an object is on the limit of visibility, the average 
background brightness in the immediate neighbourhood of the object may be taken 
as an equivalent background brightness for the calculation of recovery times: a 
detailed consideration of the pattern of the object itself is unnecessary. 

11 . The effect of the area of the conditioning stimulus, when this is neither very 
large no^ very small, is investigated. It is found that, under given conditions of 
observation, each subject has a characteristic critical angle by which he can deflect 
his fixation from the test object and still perform the visual task required of him. 
In the experiments here described the visual task was to find the test object and 
re-align upon it some form of gun sight. 




Effect of crossed electric and magnetic fields on the 
helium spectrum* 

By J. S. Bostbr, E.R.S., McOiU University, aitd 
E. R. PoTJNDEE, Ph.D., Boyal Canadian Air Force 

{Received 26 February 1946) 

[Pfate 6] 

The axithors describe a double canal-ray source, and discuss the general feajiures of greatly 
improved analyses obtained through its use. 

A small canal-ray tube is placed along the axis of a hollow core of the magnet, the pole tip 
itself serving as thfe cathode with one small central perforation. A similar tube is symmetric¬ 
ally pierced in the other core so that the double canal-ray tube provides opposing beams in the 
region between the pole faces. Here small field plates are set up to establish strong electric 
fields perpendicular to the canal rays. Light is taken out at right angles to both fields and 
analyzed by a six-prism glass spectrograph. 

Displacements and qualitative intensities are tabulated mainly for components of three 
line groups built round and including AA4922, 4472 and 4026 respectively. H = 25,800 
oersteds while E = 10-131 kV/cm. 

Resolved components remain sharp at all field strengths. A few components correspond to 
Ami=±2; but there is no evidence for Am=±Z. Intensities of many components are 
critically dependent upon electric field strength, and in at least one notable case (2®P-6®Gr) 
displacements are similarly dependent. The latter is attributed to repulsion between levels 
of different m values which nevertheless are forced to cross over each other. 


iNTBOBtrOTIOK 

Effects due to uniform external electric fields are best seen in the parhelium spectrum. 
The observed patterns (Foster 1927 ) are constant within each spectral series. As 
interpreted by quantum mechanics, they show that pairs of levels {n, I, ± m) are 
coincident and that the selection rule JZ = ± 1 no longer applies, since all electric* 
combination lines appear at suitable fields, while Am = 0 , ±1 still holds. 

Displacements and intensities are in rather good agreement with the theory, 
but some intensities show variations, probably due to unequal populations in 
initial states, and absorption within the source, which are not theoretically con¬ 
sidered. 

In the present work, a helium source is subjected to uniform electric and magnetic 
fields crossed at right angles, while the light is taken to the spectrograph in a direction 
perpendicular to each field. The observations show that the selection rule = 0, ± 1 

is now extended, since transitions occur for which | Am 1=2. It is equally clear— 
from the absence of certain components—^that \ Am | + 3 in any known, case. 

Professor Bohr ( 1918 ) interpreted certainputer components of in a supposedly 
pure Zeeman analysis by Paschen and Back, as actually arising from a component of 

* The research described m this paper was completed in 1937, prior to service in the Royal 
Canadian Air Force by one of the authors. 


Vol. 1S9. A. (i May 1947) 


[ 287 ] 


19 



288 J. S. Foster and E. R. Pounder 

electric Eeld directed at right angles to the applied magnetic field. This has been 
experimentally confirmed. It was not, however, until the introduction of quantum 
mechanics that uniformly sharp components were theoretically expected to appear 
in crossed fields of any strength. On this basis the theory of the present problem 
has been worked out at Professor Heisenberg’s Institute.* * 

On the experimental side, the problem is to establish and maintain steady electric 
and magnetic fields in a source of sufficient strength to permit analysis by an optical 
system of high resolution. Provided the design of the source is such as to give a 
suitable magnetic field, there is no further difficulty in keeping the field steady. 
On the other hand, electric fields in all strong sources depend to some extent upon 
space charges which may vary during the exposure. A major part of the experi¬ 
mental problem is therefore similar to that encountered in the application of electric 
fields alone. 

From earlier experiments with crossed fields (Foster 1931) it is already known that 
in the sharp and principal Series, where the effect of the electric field is small, there 
is a nearly normal Zeeman separation. In the P-P series, however, the observed 
separations were much less than normal while the diffuse series show more and the 
P-P series much more than normal Zeeman splitting in low electric fields. These 
wdder structures are obviously complex and only partially resolved. They are 
asymmetric in intensities and show large changes in relative intensities with 
increasing electric field. 

Steubing & Redpenning (1935) have published a more extended analysis of the 
helium spectrum in crossed fields. Owing to the presence of unsteady electric fields, 
these authors have found diffuse components at high electric fields where many lines 
fade out. Since these components are still sharp in fields higher than those established 
by Steubing & Redpenning, no confirmation of this work can be claimed. 


The lioht source anu experimental procedure 

Obviously, from what has been said, the design of a suitable discharge tube is an 
important matter. The tube Used in the present experiments ran along the axis of 
the magnetic field. Briefly, it consisted of two canal-ray tubes opposing each other 
with the result that the two streams of canal rays (after passing through the pole 
pieces) ran in opposite directions through the central region between the poles. 
Here separate field plates were used to establish an electric field at right angles to 
the streams. Light was conveniently taken out through a side tube at right angles 
to both electric and magnetic fields. Under operating conditions,^ each canal-ray 
beam was actually projected for a short distance into the other discharge tube. The 
oscillating positive ions thus aided ionization and the production of light. 

Looking now at the central portion of the tube in more detail (figure 1), the two 
pole tips are soldered to an intermediate section of brass so shaped as to form with 


* Private communication. 



Effect of crossed electric and magnetic fields on helium spectrum 289 

the pole tips a cylindrical unit in which the pole faces have a fixed separation of 
7 mm. Steel shoulders at either end of this part of the tube prevent any motion 
which otherwise might arise from unbalanced forces on the pole tips. A 0-06 in. hole 
drilled along the axis of this central structure clears the canal-ray beams which are 
limited to 0-04 in. diameter by perforated aluminium cathode disks. Owing to the 
small hole through the steel, as required on this plan, the magnetic field realized in 
the experiments is about 80 % of that obtained with solid poles. The aluminium 
disks not only limit the diameter of the beams—^and thereby prevent bombardment 
of the canal wall and electric field plates—^but also minimize sputtering. Besting 
on these disks, and closely fitted into the steel sleeve extensions from the pole tips, 
are the glass tubes which support the anodes. 



Figxjbb 1. Double canal-ray tube (anodes not shown). 

Returning now to the centre of the tube, a 0*200 in. hole is drilled in the brass 
cylinder along a diameter. This hole is threaded for a short distance at each end, and 
is designed to receive the electric field plates and the gas connexion. At one end the 
I in. positive field plate is fitted with its insulating glass sleeve, and the whole is 
supported by an outer brass pipe threaded into the main tube. Into the other end 
the grounded plate is fitted, after having been channelled to permit flow of helium. 
This plate is also supported by an outer pipe threaded into the main tube. In this 
case, however, the rather heavy outer pipe is tapered to receive a ground glass 
connexion to the vacuum system. The grub screw which fixes the position of the 
field plate is sealed by the tapered joint. 

Fihally, a central 0*080 in. hole is bored perpendicular to the one which mounts 
the field plates and this is suitably reamed to form a conical hole through which* the 
light passes out to the spectrograph. Thus the observations are made in a direction 
perpendicular to both applied fields. The side tube is sealed off by a glass plate 
fitted over a flattened portion of the brass cylinder. Seals were made with suitable 
grades of wax and cement. 


19-2 












290 J. S. Foster and E. R. Pounder 

On the way to a six-prism glass prism spectrograph the light passed through a 
Wollaston, prism^ A quarter-wave plate was inserted into the beam with vertical 
electric vector in order to reduce losses at the prism faces. 

With the magnetic axis vertical, it was convenient to lift the upper core of the • 
magnet with block and tackle and insert the tube. The anodes were connected in 
parallel to a Kipp and Zonen rectifier (‘proton apparatus’) with resistance control 
in the primary. The electric field plates were attached to a second rectifier with 
Variac control. 

The tube could be sufficiently outgassed in a few hours. It was then possible to keep 
a steady voltage of 10,000 to 12,000 on the anodes, while the electric field plates— 
with 0*75 mm. separation—drew no measurable current at 10,000 V. Since the tube 
could be operated at 0-7 mm. of purified helium, the double canal-ray feature pro¬ 
vided light of sufficient intensity even though the current did not exceed 1 mA. 

Exposures varied from 1 to 3 hr. The magnetic field was 26,800 oersteds, while the 
electric field varied in different experiments with a maximum of 131,000 V/cm. 
The tubes were commonly good for a few runs but eventually failed owing to dis¬ 
integration of the glass tubes near the cathode surfaces. Other occasional failures 
arose from breakdown of electric field plate insulation, or melting of wax seals. 

Determination oe eield strengths 

Owing to the method of fixing the separation of the pole pieces, there is no reason 
to expect the magnetic field to vary except through the current in the coils. This 
was kept constant, with the result that the normal Zeeman effects showed a field 
of 25,800 oersteds with a possible error of 2%. 

Electric fields cannot be so directly determined, since the experiments show that 
space charges appreciably modify the field one might expect from the voltage across 
plates of known separation. Since, however, in most of our experiments the dis¬ 
placements due to the electric field are very large compared with those due to the 
magnetic field, one may consider the centre of gravity of the simpler complex struc¬ 
tures as having the theoretical displacement arising from electric field alone. This 
assumption leads to rather small variations in calculated field when it is applied 
to a number of complex structures within the same group of lines. The mean value 
so obtained is therefore accepted, and is believed to have no greater possible error 
than that of the magnetic field. 


* Results 

What appear to be the more important measurements obtained in the experi¬ 
ments are recorded in tables 1, 2 and 3. They represent effects in three line groups, 
2P-4SDPP (parhelium) (A4922), 2^P-43SPDE (orthohehum) (A4471) and 
2^P--5®SPDPG (orthohelium) (A 4026) respectively. The intensities, as indicated 
by the length of the lines in figure 2, are very rough estimates. Selected plates 
are reproduced on figure 3, plate 6. It should be noted that these reproduced 



Effect of crossed elediric and magnetic fields on helium spectrum 291 



5I0S00 60 40 20 0 -30 -40 -60 -80 -90 

Displacements in cmr^ 



FiGUiuai 2. Upper: displacement-field strength plot for A4922 group. 
Lower; same for A4472 group. 










292 J. S. Foster and E. R. Pounder 

photographs cannot be compared exactly since the dispersion on various plates 
ranged between 14 and 27 cm.-i/mm. Further, the enlargement factor is not 
exactly constant for all prints. 

In the tables and reproductions of the photographs the two polarizations are 
referred to somewhat arbitrarily as perpendicular and parallel. The ‘s’ polarization 
designates that polarization which is perpendicujar to the magnetic field and 
parallel to the electric field. The ‘p ’ polarization is the reverse of this. 


Table 1 . Displacement op components op 2P-4SDFP (pabhelium) group 

PKOM NOBMAIi D LINE IN CM.“^ AT SEVERAL ELECTRIC PIELD STRENGTHS 


polarization 


P 


magnetic field strength H = 25,800 oersteds 
normal D line is 4921*926 I.A. = 20,311-695 cm.-^ 
dispersion at 4922 A = 23*8 cm.-V^J^* 

field strength in kV/cm. 



68*7 

78-8 

120-2 

line 


displacements in cm.~^ 


2P--4S 

+ 509*3 

— 

— 


606-9 


— 

2P-4D 

28-4 

- 36*9 

33-0 

55-6 

50-6 


+ 26*6 

30-7 

— 

2P-4F 

-15*9 

16-8 

19-9 


17-4 

18*7 

21-9 

2P-4P 

61-2 

»66*0 

83-5 


65*6 

72-0 

94-0 

\ 

-67-5 



"2P-4S 

+ 607-9 

--- 


2P~4D 

29*6 

33*9 

66-6 


26*9 

31-6 

52-4 


24*4 

29*7 

49-5 


157^ 

1 



14*5 

V 18-6 

28-1 


. +13*6 

1 - 


2P-4F 

— 

17-7 

— 


-18*0 

19-0 

19-9 


19*5 

20-3 

•21-9 


21-5 

23*7 

32-8 

2P~4P 

59*5 

— 

82*6 


61*8 

— 


Discussion of results 

The electric field brings out the well-known new' electric combination lines which 
join with lines of the normal spectrum to form groups with a marked overall sym¬ 
metry. This symmetry arises from interaction (repulsion) between sub-levels with. 
the same n. m values in electric field alone. In crossed electric and magnetic fields 



E;ffect of crossed electric and magnetic fields on helium spectrum 293 

there is interaction also between levels of different m values. These interactions 
produce separations between certain levels which are quite comparable to Zeeman 
separations. On this basis, normal Zeeman separations no longer are uniformly 
maintained. Even in high electric fields the somewhat complex structures continue 
to show rapid variations of relative displacements with electric field. The resulting 
structures in most cases are beyond complete resolution in these experiments. 
Parenthetically, it should be noted that the weakest member of the orthohelium 
triplet failed to appear. 

Table 2. Displacbment of components of 2^P~43SPDF (orthohelium) group 

. FROM NORMAL d LINE IN OM.”^ AT SEVERAL ELECTRIC FIELD STRENGTHS 

magnetic field strength H = 25,800 oersteds 
normal D line is 4471-48 I.A. = 22,357-707 cm."^- 
dispersion at 4471A = 20*3 cm.~Vmm. 

field strength in kV/cm. 

17 56-2 70-4 116-1 131 


polarization 


line 


displacements in cm." 

-1 




r2P-4S 

— 

— 

— 

1150*6 

— 




— 

— * 

— 

1148-1 

— 



2P-4P 

— 

— 

— 

235-5 

— 

5 


2P~4D 

+ 3-7 

14-1 

18-7 

30-0 

35-6 



+ 0-4 

10*7 

15-1 

28-8 

26-3 

32-8 



2P~4P 

-6-1 

18-4 

— 

— 

— 




-8-4 

22-6 

28-8 

45-6 

— 




-12-2 

24-9 

31-2 

49-2 

56-6 



r2P~4S 

— 


— 

1149-5 

— 



2P-4P 

— 

— 

— 

234-4 

— 



2P--4D 

+ 2-9 

+ 12-2 

10-0 

+ 16-3 

13-9 

+ 28-0 
26-6 

33-6 

P 


2P-4F 

-6*7 

17-8 

22-6 

-33-3 

34-5 

40-0 




-10-1 

19-0 

21-6 

27-6 

44-0 

46*4 


1 




— 

30-4 

50-1 

54-3 




Proof that 

I Am 1 = 0, 

1, 2 




Structures of the simplest type offer the best opportunity for a clear interpretation 
of some of the results. These are the components arising from transitions in which 
the initial states have m values ± 2. For known reasons which cannot be briefly 
stated, some of these stand well apart from their neighbours under electric fields 
alone, and this feature is retained in the present photographs. Here they may be 
expected to provide two components only (m = 2->m = 1; 7?2 = ~2->m = — l)on 
the basis of the usual selection rule Am = 0, 1. 



294 


J. S. Foster and E. R. Pounder 


Actually, only three examples of such components with suitably clear separations 
and intensities are found. The corresponding observations are set in heavy type 
in the tables. Each structure contains at least three members of diflferent wave¬ 
lengths. Except in the first example, the polarization is not the same for the three 
components. These observations mean that in all cases where clear examination is 
possible there are transitions in which m changes from ± 2 to 0. It may be concluded 
that in crossed fields 


\Am \ = 0,1,2. 


Table 3. Displacement op components op 2®P-5^SPDFG (A4026) (ortho- 
HELIUM) group prom normal D line in CM.-l AT SEVERAL ELECTRIC PIELD 


STRENGTHS 


polarization 


8 


P 


magnetic field strength H = 25,800 oersteds 
normal D line is 4026*190 I.A. = 24,830*38 
dispersion at 4026 A = 15*6 omr^lram. 




field strength in kV/om. 



41*4 

56*0 

64*0 

81 

line 


displacements in cm.”^ 


2P-5S 

— 

— 

+ 576*66 

— 


__ 

— 

574*24 

— 

2P-5P 

— 

120*7 

139*8 



— 

— 

— 

133*8 


— 

— 

— 

121*8 


— 

— 

— 

117*2 

2P-5D 

26*2 

35*4 

33*6 

40*1 

38*6 

50*9 


-j- 24*2 

30*9 

+ 35*8 

46*8 

2P^5F 

— 

— 

-1*79 



-2*31 

2*98 

2*70 

— 


4*60 

7*47 

7*50 

8*91 


6*94 

10*81 

10*52 

12*10 

2P-50 

31*4 

— 

52*3 

67*5 


34*0 


55*0 

71-9 




56*1 

I* 

— 

— 

-58*4 

— 

r2P-5S 

— 

— 

+ 575*58 


2P-5P 

— 

122*0 

125*3 



— 

119*8 

123*0 

— 


— 

— 

— 

119*4 

2P-5D 

4-22*8 

32*4 

+ 37*2 

48*8 

2P-5F 

-2*00 

1*89 

-1*90 

_ 


3*44 

3*52 

3*38 

— 


— 

5*52 

5*93 

— 


— 

6*72 

7*36 

— 

2P~5G 

— 

39*6 

43*6 

_ 


— 

48*6 

52*3 ^ 

-54*3 

— 


— 

— 

' — 

-78*0 



Foster and Pounder 


Proc. Roy. 8oc. A, volume 189, plate 6 


kV/cm. 2/>-4i'i)if7»(A4922), kV/cm. ‘l^P-A^SPDFiXAAn) 



S P P ^ F (? . 


Figure 3. £f = 25,800 oersteds in all cases. E values as indicated. In each set the arrow 
indicates the position of the undisplaced line. 


{Facing p* 294 ) 





Ejfect of crossed electric and magnetic fields on helium spectrum 295 

It is, however, equally clear that there are no observed components for which 
I /im 1 = 3. Obviously these could possibly arise only from initial levels 4F, 4^P, 6®G 
without electrical displacement, and 5®FG with relatively small displacements. 
There is no question of adequate resolution of the components, should they appear. 
The photographs show that the transitions | Am [ = 3 do not take place and in¬ 
cidentally that the closely associated transitions {m^ ^ l;m = — 

where ] Am | = 2 are also missing. 

We shall next consider more carefully a second new feature in crossed fields, viz. 
a continuous interaction between terms, notably within the same close structure, 
which leads to many striking changes in intensities and displacements with increasing 
electric fields. An outstanding example is found in the polarization of 2®P-5®G. 
While the plates show that the analysis obviously changes greatly with large varia¬ 
tions of the electric field, the observations near 64 kV/cm. are of particular interest. 

In this photograph (the bottom one on figure 3, plate 6) it would at first seem that 
the electric field had varied abruptly during the exposure and that a pair of spurious 
components were so produced. This is certainly not the case, neither may the 
effect be attributed to final states, since similar features are not found for other lines 
of comparable displacement and intensity. Indeed some well-displaced components 
are so sharp that on second consideration it seems certain that the source was 
exceptionally steady with only an important small variation in electric field from 
point to point along the axis of the tube. 

From the above one may conclude that this particular effect is mainly in the 5®G 
states and at the same time observe that the components from initial sub-levels 
m = ± 2 are well separated in ’ polarization and therefore are not of prime im¬ 
portance in the discussion. This leaves mainly the interaction between 5®G sub- 
levels with m =5 0, ± 1. Indeed, the facts strongly suggest that the m = 0 level moves 
from an inside position across the m = — 1 level, to the outside as a result of the very 
small range of electric fields represented. 

Assuming the above qualitative explanation to be correct, it is clear that such 
effects will be detected on comparatively few plates and only under the most exact 
controls. This case emphasizes the ease with which components may be blurred in 
crossed fields. Evidently the demands upon the experimenter are here somewhat 
greater than is encountered in work with either field alone. 

The remainder of the fine details are left to the reader for examination. It may be 
remarked that the present method is suitable for extensions. It would help to have 
higher magnetic fields, and shghtly inclined electric field plates, to give some 
variations which would assist one in making positive identification of many more 
components. 

Ebfbbbnoes 

Bohr, N. 1918 jff, Danshe vidensk. Selsk* Skr, ser. 8 , 1, 1. 

Foster, J. S. 1927 Pfoc, Roy, Soc» A, 117, 137—163. 

Foster, J. S. 1931 Ptoc. Roy, Soc, A, 131, 133—146. 

Steubing & Redpeiming 1935 Ann, Phys,, Lpz,^ 24, 161. 



The surprising amount of energy which can be collected 
from gases after the electric discharge has passed 

By Lobd Rayleigh, E.R-S. 

[Received 14 June 1946) 

It was found in earlier work (Rayleigh 1940 6 ) that a piece of gold sheet, say 1 cm. square, 
hung up in a stream of glowing active nitrogen became red hot, and collected an amount of 
energy which was very surprising. It was necessary to assume that every molecule of 
nitrogen in the stream imparted 10 eV to the gold plate. 

In this form of experiment it was impossible to avoid large and undetermined losses of 
energy; thus the energy actually given up by each molecule was probably much larger still. 

•A new form of experiment is now described in which this loss is avoided, A platinum strip 
is kept hot by periodic discharges, and by making the experiment ( 1 ) with the platinum 
exposed to the gas and ( 2 ) with the platinum protected hy a thin glass sheath it is possible to 
determine what part of the total energy is to be attributed to catalytic action of the discharge 
products. This amount of energy is found to be very great. Reckoning in electron-volts per 
molecule of gas present it increased rapidly as the gas pressure was lowered and at the lowest 
pressure used it rose as high as 223 eV/mol. Results of the same "order were obtained with 
other gases, so it is not clear that the glow of active nitrogen is the essential condition. This 
great liberation of energy much exceeds what can be explained by dissociation of the mole¬ 
cules and single ionization of every atom which results, which would only afford 36 V. 

In a former paper (Rayleigh 1940 b) I showed that metal plates, notably gold, silver, 
copper and platinum, hung up in a bulb containing glowing active nitrogen, but 
right away from the discharge which maintained the activity of the gas, would become 
red-hot, and that plates 1 cm. square of gold, silver and copper could even be made to 
melt, and run down,* This was made the basis of a minimum estimate of the energy 
contained in active gas. A stream of it was maintained by an air pump, so that the 
volume passed could be compared with the energy liberated on the metal. The work 
was of an exploratory nature, and the methods used were crude, but they indicated 
that the energy which was collected from the gas was so great that every molecule 
in it must be excited to the extent of at least 10 eV. 

It was shown further that this energy was of the order of a thousand times the 
energy radiated under the most favourable conditions as visual nitrogen after-glow. 

As regards the energy collected by the metal, the form of experiment then used has 
obvious weak points. The gas has to travel a considerable distance before it gets to 
the metal where its energy is given up, and in the course of doing so it is exposed to 
considerable areas of glass wall, with which its molecules may make repeated col¬ 
lisions, There is, moreover, a time lag while it travels considerable distances, and 
during this period decay of activity may occur. [If the reader is inclined to complain 
that the word activity is here used somewhat vaguely it may be defined simply as 
the capacity to impart energy to metals like gold and platinum initially cold.] 
Another weak point was that the attempt to use the visual incandescence of the 
metal as an indicator of temperature was crude. It became obvious that a method of 
electrical resistance thermometry would be much better, and, indeed, this would 
* Note added in proof . I have since melted a fine platinum wire in the same way. 

[ 296 ] 



Energy whichxan he collected from gases after electric discharge 297 

have been used had it not been that the uncertainties of the.time (1939-40) made it 
desirable to achieve what progress I could at once, without taking time to change 
the method. 

In planning further experiments, it was desired to bring the metal catalyst as 
near as possible to the discharge where the gas was made active, so that it would 
have the minimum opportunity of losing its activity. At the same time it was 
necessary* to make allowance for he direct-heating effect of the discharge on the 
catalytic metal, which cannot be altogether avoided without making the gas travel 
considerable distances—and this was inadmissible. 

To produce the electrodeless discharges, the same oil condenser was used as in 
my previous work. It was charged by an induction coil with a slow mercury break. 
This break was generally of a design due to the late Sir James Mackenzie Davidson. 
An ebonite wheel with a conducting segment is mounted on an axle which is inclined 
at 45° to the horizontal. This wheel is immersed in mercury and makes and breaks 
contact once a revolution. The mercury is covered with a layer of alcohol. 





Figure 1. Reduction |. 


The discharge vessel is as shown in figure 1. Along the axis there is stretched a 
platinum strip, over which a glass sheath can be slipped. The platinum can be heated 
by a current from a battery, and the tube is wound with a wire coil, which is used to 
excite the electrodeless discharge. When the discharge passes, the platinum strip 
gets heated, and its resistance increases. This is attributed partly to the direct- 
heating action of the discharge, and partly to the action of the platinum strip in 
catalyzing the discharge products (ions and dissociated atoms?). 

During an experiment, the platinum strip is kept at an arbitrary but constant 
resistance of B ohms. For this purpose it is made one arm of a Wheatstone’s bridge, 
and the current adjusted to heat the strip until the given resistance is attained. The 
current {Ci amp.) is measured. Then the glass sheath is slipped into position, so as 
to screen the wire from dissociation products, A larger current (O^ amp.) must now 
be passed to restore the resistance to its standard value of B ohms. The energy 
given up to the platinum wire by the dissociation products is 

{C^^-C^^)BtW-Beo., 

when t is the time interval in seconds from one discharge to the next. 



298 ‘ Lord Rayleigh 

This energy is derived from, and therefore is contained in, the gas volume v at 
pressure p cm. of mercury. 

Thus the energy per unit volume reckoned as at atmospheric pressure is 

^^ -W-sec./c.c. 

vp 

In a typical experiment, when the conditions were adjusted to get the best 
ejBFect at the chosen pressure, Ci = 0-78 amp., = 1*73 amp., It = 0*470 ohm., 
t = 0*25 sec., v = 19 c.c., p = 2-6 x 10“® cm. of mercury. In this case the energy is 
432 W-sec./c.c. as at N.T.P., or 98-5 eV/mol. of nitrogen present. 

All the molecules in the vessel play their part, and must travel from the point in 
the discharge vessel where they receive energy to a point in the central strip where 
they give it up. They will not in general take the shortest path, that is, the path along 
the radius of the vessel. It is not worth while to enter upon elaborate calculations 
of the most probable length of path. I shall take it as 1 cm., about the same as the 
radius of the vessel. 

If the molecule is to receive energy and give it up more than once during a discharge, 
it would in general have to travel some 2 cm. while the discharge is actually passing. 
The duration of discharge (several oscOlations) was estimated by photographing a 
slit backed by its light, in a revolving mirror, and found to be 3*76 x 10“’® sec. This 

2 

requires the molecule considered to have a velocity of x 10® cm./sec. or say 

5 X 10® om./sec. Now the molecular velocity of nitrogen is 6 x 10^ cm./sec. at 273® K. 
To raise it 10 times, the absolute temperature must be increased 100 times, i.e. to 
the temperature of the hottest stars. It seems clear, therefore, that a molecule 
could not travel fast enough to give up energy to the platinum strip more than once 
during the short duration of a discharge. If not, it must carry 98-6 eV at one time, 
and it is not easy to see how according to current ideas it could do this. The energy of 
dissociation of nitrogen is 7-4 (?) V, and this, together with the ionization of both the 
atoms,wouldonlyafford7-4 + 2x 14-6V or 36V, Considering that the discharge gives 
a band spectrum and not a line spectrum, the idea of complete dissociation and ioniza- 
tionis somewhat fantastic; butevenif we make this assumption the difl&culty remains. 

There appears to be no advantage in increasing the number of turns to the 
maximum, and close winding was inconvenient, because the coil showed a tendency 
to spark over. A coil having 26-5 turns was used in further experiments, as being 
about the us€#il maximum. * 

The pressure was then varied, leaving the other conditions unaltered: 


pressure (cm.) 
26 X10-3 
12-4x10-3 
5*2 X 10--3 
2-6x10-3 
1-5 X 10-3 
1-1X 10-3 


W-sec./c.c. eV/mol. present 


6-32 

19-1. 

92-3 

309 

643 

1020 


2-16 

4*37 

2M 

70-6 

147 

233 



Energy which can he collected from gasee after electric discharge 299 

It appears, therefore, that the energy per molecule increases very greatly as the 
pressure is diminished. 

It was of interest to see whether these very large values of the energy per molecule 
are peculiar to nitrogen, or whether they would be found in other gases also. A few 
results only are available at present: 


gas 

oxygen 

hydrogen 

nitrogen 


F, pressure (cm.) W-sec./c.c. 


eV/mol. 


2*6 X 10-3 
4-3 X 10-3 
2-0x10-3 


432 98-5 

470 ■ 107 

309 70-6 


These are of the same order of magnitude as the results for nitrogen, one of which 
is repeated for convenience of reference. 


COKCLTTSIOIT 

If a platinum surface is exposed to nitrogen through which a discharge has 
passed, the energy which it collects from the gas is greatly in excess of what would 
be expected if the nitrogen were completely dissociated into atoms, and every 
resulting atom was singly ionized. It is difficult to see how the result can be explained ’ 
by current conceptions. Similar results have been obtained with oxygen and 
hydrogen. 


RrjFEEBNOES 

Rayleigh, 1940 CI Proc. Boy, Soc, A, 176, 1. 
Rayleigh, 19406 Proc, Roy. Soc, A, 176, 16. 



Fragmentation of shell cases 

By N. F. Mott, F.R.S. 

{Received 14 December 1946) 


A theory is given of the^reak-up of the metal case of a cylindrical ring-bomb, in which the 
lines of fracture perpendicular to the axis of the bomb are predetermined. Prom the theory 
an expression is given for the length of the average fragment; this is shown to depend, if 
certain hypotheses are made, on the radius and velocity of the case at the moment of break¬ 
up, and on the mechanical properties of the metal. 

1. Introbxjction 

This paper is the outcome of attempts made by the present author to find a theoretical 
basis for the prediction of the distribution in weight of the fragments of shell or 
bomb cases after detonation of the filling. Little attempt will be made here to relate 
the theory to experiment, because experimental determinations of the distribution 
of the weights of fragments have not usually been made under the conditions best 
suited for theoretical interpretation. The purpose of this paper is to present a theory, 
and to state the kind of experimental work which if carried out would show whether 
or not it is correct. It is suggested here that the average size of the fragments from 
a shell case of a ductile metal depends on a property of the metal which is not usually 
measured, namely, the scatter in the values of the reduction in area at which fracture 
occurs in a tensile test. Determination of this quantity is of interest for a theory of 
fracture in metals, and measurements of the fragment weight distribution are there¬ 
fore capable of giving information of fundamental interest about the properties of 
metals. 

When the explosive filling of a shell or bomb detonates, the case is subject initially 
to an extremely high pressure from the gaseous products of detonation; under this 
pressure it begins to move rapidly outwards. If the case is made of a ductile mateiriai 
such as steel, very considerable plastic expansion, as much as 50 %, occurs before 
the case breaks; this is most easily seen by examination of the larger fragments from 
a shell which include parts of the original inner and outer surfaces; the distance 
between these surfaces will be less than in the original case. By the time that fracture 
occurs the v^city with which the case is moving will be nearly equal to the final 
velocity witlRv’hich the fragments are projected, and the gas pressure wifi have 
dropped to a small fraction of its original value. The fragmentation of a shell case 
must thus be thought of as the tearing apart of a rapidly expanding tube when the 
material of the tube reaches the limit of its ductility. 

Fracture in -forged steel shell cases occurs normally by the formation of cracks 
parallel to the axis of the cylindrical part of the case, giving the familiar long thin 
fragments from bursting A.A. shells. A complete theory of fragmentation would 
have to account for both the width and length of fragments. If, however, the wall 

[ 300 ] 



Fragmentation of shell cases \;S01 

ill consists of a number of coaxial circular rings, all with the same inner and 
ii, stacked one above the other round a thin steel inner lining, the problem 
>; each ring will break into a number of pieces, and the planes of fracture 
xallel to the axis of the cylinder. Ring-bombs of this type have been made, 
i to break up in the way stated here. 



at moment of fracture 


FiauRB 1. Case of shell before detonation and at moment of fracture. 



Figure 2. Types of fracture of bomb casings. 


The experimental arrangement to which the present theory will be applied will 
thus be as follows: The explosive is in the form of a cylindrical tube with a thin metal 
wall, of length several times the diameter to ensure that a steady detonation wave 
is set up. The tube is encased in a series of metal rings aU of the dimensions, 
which fit it closely. The explosive is detonated and the fragments collected. The 
theoretical problem is to calculate the distribution of lengths {A Bin. figure 1) which 
the fipagments of the rings will be found to have. # 

Two main types of fracture have been observed in bomb and shell casings ;**shear 
fracture, approximately at 4S° to the circumference of the case, as in figure 2a, 
and a combination of fibrous fracture (of the same type as at the bottom of the cup 
and cone fracture in a tensile test) and shear fracture as in figure 26. In § 3 considera¬ 
tion is given to the mechanism of fracture; for the purpose of the mathematical 



302 


N. F. Mott 


theory to be developed here, it need only be assumed that fracture, oncef 
takes place so rapidly that it can be considered instantaneous compared 
rate of strain of the metal- 



2. Theoby of fragment length 

The tensile strength of a brittle material such as glass shdws a very co 
scatter, of the order ± 20 %, from specimen to specimen. This is becau 
begins at one of a number of weak spots on the surface (the Gri®|.th 
strength is believed to depend on the depth of the deepest crack, and thi 
from one specimen to another. The true stress at fracture of ductii^ mat 
as steel shows much less variation, as does also the reduction in area at 'v 
ture occurs. No systematic investigation of the magnitude of the scatter has been 
made, but for specimens cut from the same bar it can be less than ± 1 % for a steel 
showing a total reduction in area of 50%. Initiation of fracture in metals is not 
necessarily a surface phenomenon; in a tensile test it starts in the interior -of the 
specimen and little is known of the nature of the points of weakness responsible. 
Nevertheless, it is unlikely that even in a homogeneous material there will be 
no scatter. In any case it will be assumed that there does exist a scatter character¬ 
istic of the homogeneous material, and not due to inaccuracies in the apparatus. 
It will be found that the magnitude of this scatter determines the fragment size. 

Denote by 8 the plastic strain of the specimen, so that if the cross-sectional area 
is A and the original area Aq, a a 


Assume that the chance that an unfractured specimen of unit length will fracture 
when the strain increases from 5 to 5 + eZs is 

Cer^ds, ( 1 ) 

where 0 and y are constants. The exponential expression is chosen as the simplest 
form which gives a rapid increase from negligible to large values as s increases. With 
this assumption, the chance p that the specimen breaks b’fefore a strain 5 is reached 
is given by 


= (l-p)Oey^, 


* whence 


= l-expj-~er«j. 


The average strain for fracture Sq is given by 

-i{logg)+.), 


where 


ice® exp (—e~®) dx = 0-571, 



Fragmentation of shell cases 303 

and the r.m.s. of the scatter in the strains at fracture is 




1-28 


r 


A value of y equal to 128 would thus give a r.m.s. scatter of 0-01 in the strain at 
which fracture occurs. 

The argument now is, that if y were infinite so that the strain at fracture were 
perfectly definite, the expanding shell case would break in all points at once. With 
the assumption (1), however, there is in any length of the material a finite probability 
of fracture which increases rapidly as s approaches the critical value Sq. As soon as 
fracture takes place at one point, stress is released in the neighbourhood, and the 
unstressed regions spread with a velocity which can be calculated. This is shown in 
figure 3; a fracture is supposed to have occurred at A and stress has been released 
in the regions AB and A'B' which are shaded. Fracture can no longer take place in 
the shaded regions; on the other hand, in the unshaded regions plastic flow is still 
going on, s is increasing and according to formula (1) fracture becomes more and 
more likely. The average size of fragment will be determined by the rM© at which 
the shaded regions in figure 3 spread and so prevent further fracture. Thus this 
rate must be calculated. 



Figxjbb 3. Release of strain round a tensile fracture. 


Let r be the radius of the case and v its velocity outwards at the moment of break¬ 
up, it will be supposed that the thickness of the case is small compared with r, and 
that the whole process of break-up takes place in a time interval during which there 
is little change in r or v. Then the rate of strain 6,s\it of the material is given by 

ds _^v 
dt r’ 

If in figure 3 an arbitrary point 0 be taken on the circumference of the case at a 
distance a from the point of fracture, then relative to C the velocity of the whole 
stress-free region AB is 

^(a-x), 

where x is the length AB. If p is the density of the material and is its flow stress 
in tension at the mpment of fracture for the rates of strain concerned, then the 
equation of motion of AB is 


Vol. 189. A, 


20 



304 


N. F. Mott 


which gives on integrating {vjr being treated as constant) 

x^jt = 2rPplf)v. . (2) 

The width x of the unstressed region thus increases as the square root of the time. 
/^This solution treats the metal as inelastic, and gives an infinite value for the 
ini tial velocity of the boundary B. A more exact solution* gives a value of the 
velocity equal initially to that of sound in steel, but except in the opening stages 
approximating closely to that given by (2). 

Making use of (1) and (2), the distribution of fragment lengths can be found. In 
a ring of diameter I (= 27rr), suppose that at any moment n fractures have formed. 
Then the rate of increase of n with^increasing strain is given by the equation 

g-i/Oer., (3) 


where/is the proportion of the ring which is still stressed. It is convenient to intro¬ 
duce a new variable 


Then (3) becomes 


cr = ys. 

dcr 




( 4 ) 


and if a jfracture occurs when cr = the region round it which is unstressed and thus 
safe from further fracture (shaded in figure 3) is at any subsequent value of cr, by (2), 
equal to 




(5) 


where a:,, = {2Ppjpy)irlv. 

The length Xq is on dimensional grounds obviously proportional to the average 
fragment length. The distribution of average fragment lengths was found empirically 
as follows: Using equation (4) it was assumed that the first crack would form when 
n was umty and thus for a value cTq of cr given by 


y 

A line of length I (say 10 cm.) was ruled on paper to represent the circumference of 
the cylinder and a mark, representing a fracture, made on it at a point P chosen at 
random. The next fracture will occur when n = 2; before the line is cut at random a 
second time to represent this happening, a range Ax must be marked off round the 
first cut to represent the shaded region where the strain has disappeared. By (4) 
the increase in o’ is unity, so the interval marked off on each side is Xq. The second 
cut must now be placed on any part of the line excluding the shaded region. / is 
now (I — 2cr0) /i; the increase in cr before the third cut is made is by (4) given by 

Act = ((T-cTq = 1). 


* E. H. Lee (to be published). 



305 


Fragmentation of shell cases 

Round the second cut a shaded zone of x^iAcr)^ must be drawn in, and the first must 
be widened according to equation (5). The process was repeated until the whole line 
was covered. 

By repeating the whole procedure a number of times and measuring the lengths 
of the intervals between adjacent cuts on the line, a histogram was drawn giving 
the numbers of intervals between 0-4:Xq and and so on. This is shown in figure 4. 

The curve drawn through the histogram should give approximately the numbers 
of fragments with lengths between x and x + dx. 



10 20 30 


x/xq 

Figijbb 4 

The calculations were made with I/xq == 20; the distribution would not be sensibly 
different for larger values. 

It wiU be seen that: 

(1) The fragments have lengths most of which lie between and 2xq, and that the 
average length is about hdxQ, 

(2) Xq is proportional to r, so that if the linear proportions of the b'omb are scaled 
up by a given factor, the average fragment length is changed by the same factor. 

(3) Xq is inversely proportional to v, the velocity of the case at the moment of * 
break-up. 


20-2 



306 


N. F. Mott 


As a numerical example, consider a bomb of diameter 3 in. so that r will be about 
2 in. at the moment of break-up. Suppose that the wall thickness is such that 
V = 2000 ft./sec., and that a steel case with p = 480 Ib./cu.in. has a flow stress in the 
work-hardened state of 50 tons/sq.in. Then we find 

xq = l-e/^yin. 

Thus if y100, the average fragment length is about 0*24 in. 

3. A THBOEETIOAL ESTIMATE OF THE CONSTANT y WHICH DEFINES THE 
SCATTER IN THE VALUES OF STRAIN AT FRACTURE 

In the theory of A. A. Griffith of the strength of brittle solids, it is assumed that 
fracture starts at the deepest of a number of surface cracks. If c is the depth of a 
crack, then according to Griffith the stress at which the crack will spread is 



where E is the elastic modulus and T the surface tension. The variation in the depth 
of the deepest crack determines the variation in the tensile stress from specimen to 
specimen. 

Tor .ductile metals no satisfactory theory exists which accounts for the initiation 
of fracture at a given tensile stress. It will, however, be assumed that fracture 
begins, as in brittle materials, at one of a number of Veak points’ distributed 
throughout the material, and that the relation between the ‘size’ c of a weak point 
and the stress required to start a fracture there is of the same type as (6), namely, 

P = const./c^. (7) 

Fracture will then occur when, owing to the work-hardening of the material, the 
maximum principal stress reaches a value great enough to start a crack at the weak 
point for which c is largest. It will be assumed, moreover that the values of c are 
distributed about a mean value Cq according to the Gaussian distribution function; 
thus the chance per unit volume that there is a weak point with parameter between 
c and c -h dc will be taken to be equal to 

( 8 ) 

Thus, if a specimen of volume F has not fractured for a smaller stress, the chance 
that it will fracture for a stress between P and P -1- dP is 

where c is given in terms of P by (7). 

Interest is centred only in a small range of values of c in the neighbourhood of the 
largest. Therefore let 

c — Cjj+aij 



Fragmentation of shell cases 


307 


where is defined by J BVdc exp J = 1- 

Neglecting terms in x^, it is found that 

(C - Co)2 = (Cji - Co)2 + 2x{cs - Co), 

and, using this approximation, (9) gives 


(9) 

( 10 ) 


(Cj; —Cq)^ 

2 t 2 



Now from (8), integrating over all values of c, it is clear that tB, apart from a 
nmnerical factor of order unity, is equal to the total number of weak points per unit 
volume. This is denoted by N, and N is assumed to be a very large number. Thus the 
error will be small if / \2 

i2^^=log,(NF). (11) 


If (10) is substituted in (8), then 

const■ dx exp ■ (12) 

A broad distribution of values of c will be assumed, so that t and Cq are of the same 
order of magnitude. Thus from (11 ) Ck^Cq. Hence approximately 

Cr{cb-Co) Jcs^ = 21og,(NF). 

Thus the chance that the material contains a weak point with parameter x between 
X and dx is, if x is small - - 

const, dx exp I ^ ^ • 

If the stress-strain curve for the material in tension is 


y y 7> 

the constant y is thus y = — 2 log (NF) , 

which gives fi:om (7) y — 21og(NF)^ — 

If the stress-strain curve for large strains is of the type 

P = Pl + P2l0g(l-F5), 

tUB'gives 

where is the true stress and 5jpt the plastic strain at fracture. 

For n the most likely hypothesis will be to take = J, as for Griffith cracks, but 
obviously other values are possible. The value of N, also, can only be guessed; it 



308 


N. P. Mott 


may be bound up with the disloca|iions in cold-worked metals, which are of the 
order em. apart. Fortunately, since N occurs in a logarithm only, the exact 
value is not important. If iV = 10^^, then, if-F does not differ by more than a few 
powers of 10 from 1 c.c., 

y ^ 160i2/ip( 1 + • 

Some values of P^, P^, have been deduced below from curves given by Korber & 


Rohland (1924): 

reduction 
in area 

true U.T.S. 

Pjp (kg./mm.^) 

Pz 

r 

iron 

0*83 

54 

34 

20 

steel 0*1C * 

0*70 

70 

42 

42 

0-25 C 

0-63 

80 ' 

45 

53 

0-45C 

0-57 

82 

38 

67 


For mild steel, then, according to the formulae of § 2, a bomb of the design sug¬ 
gested there would give fragments of average length 0*6 in. 

Referring to formula (5), and assuming that logiV is about the same for all 
materials, it is seen that.the average fragment length is proportional to the following 
term which depends on the properties of the material: 



Thus a material with a high-stress at fracture gives large fragments, and a high 
ductility is the strain at fracture) also gives large fragments. A rapid rate of 
hardening near the fracture point (i.e. a large value of P^) will lead to small fragments. 
Since most metals harden much more rapidly in the initial stages of cold work, low 
ductility will for this reason also lead to small fragments. 

I 

This work was carried out while the author was working in the Ministry of Supply 
and he wishes to thank the Director-General for Scientific Research and Develop¬ 
ment for permission to publish it. 


RBrEEENOE 

K 6 rber, F. & Eiohland, W. 1924 Mitt, K,-Wilh*-l7ist, Eisenforsch, 5 , 55 . (See also Korber, F. 
& Krisch, A. 1939 Handbuch der WerkstqffprUfung, 2, 44 .) 



On a method of increasing contrast in microscopy 

By Sm Thomas Merton, Trbas.R.S. 

{Received 29 August 1946 .—Read 14 November 1946) 

[Plate 7] 

When an object is viewed under the microscope, its visibility depends on its opacity, 
its colour and the extent to which it differs in its refractive index from the surrounding 
medium. There are a number of methods by which the contrast of a transparent 
object, whose refractive index differs slightly from the surrounding medium, can be 
enhanced. These include dark-ground illumination and, in the special case of bi- 
refringent materials, the use of polarized light. The most recent and successful 
addition to these methods is the phase-contrast technique which owes its inception 
to Zernike ( 1934 ) and has since Jbeen the subject of investigation by a number of 
other workers. 

It has occurred to me that a powerful method of increasing contrast in the case of 
transparent substances, in particular certain specimens of biological interest, would 
be provided by placing the object under investigation between two half-silvered or 
half-metallized surfaces which would thus constitute an interferometer. Our know¬ 
ledge of the interference phenomena which occur under such conditions has been 
greatly enlarged in a recent series of papers by Tolansky (1943 a, 6 , 1944 a, 6 , 1945 , 
1946). 

For the present purpose it may be noted that a large number of effective reflexions 
between the plates, giving rise to very narrow fringes, is not essential. The first 
experiments were carried out by half-silvering two plates of glass (which were not 
opticsClly flat) so that the amount of light reflected from a surface was about 40 to 50 %. 
A small amount of pyroxyline was dissolved in amyl acetate and a fine spray was 
allowed to fall on one of the silvered surfaces by the use of an atomizer. When the 
amyl acetate had evaporated, the surface of the plate was covered with a large 
number of thin flat disks of pyroxyline. A drop of water was put on the plate and the 
second plate was firmly pressed down, the water which was driven out being removed 
with a piece of filter paper. The plates were put on the stage of a microscope and 
examined with a low-power objective, the source of light being a mercury arc lamp. 
The two yellow rays were absorbed by a thick sheet of didymium glass, the visible 
rays being in effect limited to the green line at A5461A and the violet line at A4369 A. 
If the two plates had been optically flat and parallel to one another the field would 
(in the absence of the pyroxyline disks) have beqn uniform in colour, but as they 
were not perfectly flat the field was crossed by a number of broad bands which formed 
a kind of contour map showing the distance between the plates. With glass of 

[ 309 ] 



310 


Sir Thomas Merton , 


reasonably good quality these bands were for the most part broad in comparison 
with the objects under investigation. At many parts of the field the colour alternates 
from green to violet, whilst at some points there was an approximate cpincidence 
between the maxima and somewhat neutral bands with intervening dark spaces 
were seen. The pyroxyhne disks presented a remarkable appearance, being in general 
conspicuously difierent in colour or intensity to the particular part of the background 
against which they were seen. A disk might be seen as a green circle on a violet back¬ 
ground or vice versa, and the thickening which occurred at the edges of the disks 
was manifest in many cases from the appearance of a narrow circle of different 
intensity or colour. These pyroxyline disks, when examined in the same way on 
unsilvered glass, were difficult objects to see. At this stage of the investigation the 
use of silvered surfaces was abandoned. Whilst these half-silvered surfaces are of 
the highest efficiency from an optical point of view they were so easily scratched or 
otherwise damaged that it was found preferable to use half-platinized surfaces in 
which the platinum had been 'burnt’ into the glass and which were not impaired 
by any ordinary usage. The half-platinized surfaces were prepared by a slight 
modification of the method described by Rheinberg (1920). The plates were flowed 
over with a solution consisting of chloroplatinic acid 2-1 g., collodion 2 g., ethyl 
alcohol 77 c.c,, acetone 33 c.c., which was compounded by dissolving the chloro¬ 
platinic acid in the alcohol and the collodion in the acetone and then mixing the two 
solutions. When the plate had been allowed to dry in a warm place it was put on 
a flal) plate of stainless steel about J in. thick, a foot long and 4 in. in width. One 
end of the plate was heated by means of a bunsen burner to a temperature estimated 
at 500 to 600° C and the plate gradually pushed from the cold end to the hot end until 
the collodion had burned off and the platinum was left as a mirror on the glass. After 
cooling, the plate was flowed over with a similar solution in which about 0-2 % of 
bismuth chloride with a few drops of hydrochloric acid was substituted for the 
chloroplatinic acid. On bxirning off the collodion a slight bloom was left on the 
platinum, but this bloom was removed by waving the bunsen flame over the surface 
until it disappeared. With a little .practice the thinnest microscope cover-glasses 
(about 0*17 mm. in thickness) can be successfully coated without distortion and the 
surfaces are remarkably durable and permanent. 

Certain difficulties arise when objectives of short focus and large it.a, are used. 
Since the rays pass through the plates in a wide cone, and since the difference of 
path of the successively reflected beams is 2dj[icos d, where d is the distance between 
the plates, fi the refractive index of the medium and 6 the angle which the rays make 
with the normal, it follows that the colours are diluted to a degree depending on the 
angle of the cone subtended and the distance between the plates. Moreover, since 
the interferoraeter constitutes an angular filter it follows that the objective cannot 
be uniformly illuminated, with the consequent loss of some resolving power. There 
are several methods which might be adopted to mitigate this difficulty, and it is 
proposed to make further experiments in this direction; possibly by the use of an 
annular stop or a zone plate immediately below the substage condenser. 




Ptoc, Roy. 


Figxjbe 3 


{Facing 310 ) 




On a method of increasing contrast in microscopy. 311 

I have found, however, that the following rather inelegant and unorthodox but 
very simple n^ethod gives excellent results. A piece of ground glass is placed on the 
stage of the microscope with the ground surface uppermost, or alternatively the 
underside of the platinized slide is ground. The image of a mercury lamp after 
traversing the didymium glass filter and after refiexion from the mirror of the micro¬ 
scope is brought to a focus on the ground-glass surface by means of a lens of If in, 
focus working at //3-1. This results in the objective being illuminated in a manner 
determined by the polar curve of the ground glass, the greatest intensity being along 
the optical axis. This method has been used with objectives down to 4 mm. focus, and 
highly coloured fringes with good definition Have been obtained. 

As a test object I have, following Burch & Stock (1942) used wet epithelial cells 
from the dorsum of the tongue. The cells are easily detached jfrom the surface of the 
tongue, and a drop of the saliva containing them (with as few bubbles as possible) 
is laid on the platinized slide. The cover-glass is laid down on the drop and gently 
squeezed down under a piece of filter paper which removes the liquid which exudes 
from the sides. Many of these cells show very poor contrast when viewed in the usual 
manner by transmitted Kght, and indeed some of them can only be found with 
dif&oulty, but when seen between the half-platinized plates they are even more 
striking than the pyroxyhne disks; the structure can be seen in great detail^ and 
variations in thickness are clearly shown as changes in colour or intensity. I have not 
yet had the opportunity to try these methods with a 2 mm. oil-immersion objective, 
but there seems to be no reason to suppose that good results will not be obtained. 

For practical purposes the method seems to be limited to specimens not 
exceeding* a few microns in thickness. For greater thicknesses the loss either of 
contrast or of resolving power seems prohibitive. For the highest degree of contrast 
it would be necessary to illuminate the specimen with parallel or nearly parallel 
light. This would reduce the resolving power to a degree which would seriously 
limit the application of the method. The distribution of intensity over the objective 
resulting from the use of the gnound glass provides a useful compromise between 
loss of contrast and loss of resolving power: but it is essential that the distance 
between the semi-reflecting surfaces should be small. 

There is another method which may be useful occasionally for the study of very 
small particles in the case in which they can be examined in a dry state. 

When a film of cellulose nitrate or cellulose acetate is laid on a plate of glass with 
a film of water between the cellulosic film and the glass, the water wiU evaporate 
through the film, which is left in optical contact with the glass. Owing to the con¬ 
traction of the film on drying it is necessary to paint over the junction of the film 
and the bare glass with a solution of gelatine which prevents the film from curling 
up and detaching itself from the glass on drying. 

It is possible to prepare films of silvered or platinized cellulosic material by 
silvering or platinizing a sheet of glass (in the case of platinizing the treatment with 
bismuth must be omitted) and by pouring over the surface, for example, a ^ % 
solution of cellulose nitrate in amyl acetate.^ When the solvent has evaporated the 



312 


Sir Thomas Merton 


film can be detached from the glass and usually carries the silver or platinum with it. 
In a similar way metallized films of cellulose acetate can be prepared by the use of a 
16% solution of cellulose acetate in acetone which is spread over the silvered or 
platinized glass with a steel straight-edge with copper wires of 30 to 40 s.w.g. as 
distance pieces along the edges of the glass. 

If a piece of half-metallized film prepared in this way is laid on a half-metallized 
sheet of glass with a film of water between the surfaces and the water is allowed to 
evaporate, the two metallic surfaces are left in optical contact. If the water between 
the film and the glass contains any solid particles the cellulose film is raised by these 
particles on drying, and on examining the plate through a microscope with a mercury 
arc as a source of illumination, each particle is seen to be surrounded by a concentric 
series of interference fringes. By counting the number of rings and estimating the 
fraction of a ring, the thickness of the particle can at once be determined, N rings 
denoting a thickness 

In the same way a fragment of a fibre will be fianked by a series of linear fringes 
which follow the contour of the fibre and which provide a measure of its thickness. 

The fringes which are produced in this way differ conspicuously from the Newton 
fringes which are seen when a half-silvered convex surface is laid on a half-silvered 
plato. Owing to the manner in which the cellulosic film is distorted by the particle, 
the fringes are closest together at the centre of the system and get wider and wider 
apart until zero difference of path is reached at a distance from the centre at which 
the two metallic films pass into metallic contact. ^ ^ 

It will be seen, however, that the exact law governing the separation of the rings 
is irrelevant in so far as the determination of the thickness of the particle is con¬ 
cerned, since it is only their number and an estimate of a fraction of a ring which is 
required. If the mercury line A = 4358 A is used the first maximum would denote a 
thickness of 0-00022 mm., whilst a particle which raised the film to a height such 
that the first minimum fell on the centre of the particle would have a thickness of 
half this value or 0-00011 mm. The method provides a simple means of measuring 
very small particles and fibres with a considerable degree of precision. 

There is, however, a further consideration which may prove to be of greater im¬ 
portance. A perceptible darkening can be observed at a separation which is much 
smaller than the separation corresponding to the first minimum, and it seems possible 
that with a sufficiently powerful source of illumination and films of high reflecting 
power particles as small as 10“® mm. might be detected. 

It is hoped that these methods may find some useful application in the biological 
field. 

Note added December 1946. Since this paper has gone to the printers my attention 
has been called by Dr C, R. Burch, E.R.S., to the work of A. M. Brederikse {Acta Brev. 
NeerL 111, 8/9 (finit 10.x. 1933)) in which the interferometer has been applied to the 
study of biological specimens. Frederikse’s paper covers a part of the work described 
in this communication and I am glad to be able to call attention to his work. 



On a method of increasing contrast in microscopy 


313 


References 

Biirch, C. B. & Stock, J. P, P. 1942 J. ScL Instrum, 19, 71. 

Rheinberg, J. igzo Discussion on making reflecting surfaces, London: Physical Society. 
Tolansky, S. 1943 a Nature, 152, 722. 

Tolansky, S. I943’‘6 Phil, Mag: 34, 656. 

Tolansky, S. 1944a Nature, 153, 106. 

Tolansky, S. 19446 Phil, Mag, 35, 120. 

Tolansky, S. 1945 Proc, Boy, Soc, A, 184, 41. 

Tolansky, S. 1946 Proc, Roy, Soc, A, 186, 261. 

Zemike, F. 1934 Mon. Not. B, Astr. Soc. 94, 377. 


Description of Plate 7 

Figure 1. Epithelial cells from the tongue with groups of bacteria between half-platinized 
surfaces photographed in the light of the two mercury yellow lines. These cells are 
colourless and transparent (imperfect focusing). 

Figure 2. The same cells photographed in mercury green light (bacteria nearly invisible). 

Figure 3. The same cells photographed in mercury violet light (x 450). 

Figure 4. Two particles of rouge, showing ten and one complete rings respectively, between 
a platinized pellicle of cellulose acetate and a platinized plate. 


The spectra of flames containing oxides of sulphur 

By A. G. Gaydon, D.Sc., Chemical Engineering Department, Imperial College, 
AND G. Whittingham, Ph.D., British Coal Utilization Research Association 

{Communicated by Sir Alfred Egerton, F.B.S,—Received 30 May 1946) 

[Plates 8 and 9] 

The spectra and characteristics of hydrocarbon and related flames oontaming small amounts 
of SOg and SO 3 have been studied; observations have been made with a view to elucidating 
the mechanism of carbon formation in flames. 

Band systems due to Sg, SO, CS and SH occur; this is the first record of the SH band in 
emission. The mechanism of the formation of these radicals is discussed. The outer cones of 
flames containing SOg show strong ultra-violet continuous emission, and this is provisionally 
attributed to direct association between SOg and atomic oxygen to give SO 3 . Failure to 
obtain luminous reaction between SOg and atomic oxygen in a discharge tube at room 
temperature indicates that this reaction requires an activation energy. The formation of 
SO 3 cannot be used as a quantitative test for atomic oxygen in flames. 

SO 3 increases carbon deposition in hydrocarbon flames and it is considered that it 
can induce chain reactions leading to the polymerization and decomposition of hydro¬ 
carbons. These chain reactions are maintained by free radicals which may be produced 
either by direct reaction of sulphur trioxide with hydrocarbons or inciirectly through the 
formation of peroxides. SOg and HgS, which reduce carbon deposition, either inhibit the 
chain processes or remove carbon as soon as it is formed. 


314 


A. G. Gaydon and G. Whittingham 


Introduction 

The formation of sulphur oxides during the combustion of fuels containing sulphur 
compounds is a subject of technical importance, and the oxidation of sulphur 
dioxide in diiBFusion flames of CO-air, Hg-air, and CH4-air has been studied by 
Dooley & Whittingham (1946) as part of an investigation on the role of sulphur 
compounds in combustion sponsored by the Boiler Availability Committee. For a 
given concentration Of SO2 the highest conversion to SO3 was in the CO-air flame, 
and the luminous, smoky flame of CH4-air was the least effective in this respect. It 
was considered that the association reaction 

SO^ + O-SOa-l-Slkcal. 
was responsible for the oxidation. 

The role of atomic oxygen in combustion reactions has been discussed by Gaydon 
{1944), and a sensitive technique was described which appears to give direct evidence 
of the existence of 0 atoms in various flames. It was shown that the yellowish-green 
continuum emitted by flames to which small amounts of nitric oxide had been added 
was identical with the spectrum of the air afterglow and could be attributed to the 
reaction of NO with atomic oxygen. Application of this technique showed that there 
was a relatively high concentration of free oxygen atoms in CO flames, but during 
the initial stages of CH4 combustion the concentration was low. These results were 
supported by the extent to which SOg was oxidized in CO-air and CH4-air flames, 
and in the discussion on the paper by Dooley & Whittingham one of us (A. G. G.) 
suggested that a continuous emission might be detected in flames containing oxides 
of sulphur due to the reaction 

SO2 + O = SOs-hSlkcaL, 

the emission lying at wave-lengths rather longer than 3600 A. 

The luminosity of coal-gas flames has been found to be markedly influenced by 
the presence of added sulphur oxides. It was shown (Whittingham 1945) that in 
the presence of about 4 % SO2 the luminosity of an under-aerated coal-gas flame was 
greatly reduced, and the resultant flame was strongly reminiscent of burning 
sulphur. On the other hand, the addition of about 0-1 % by volume of SO3 rendered 
a Bunsen flame luminous. Although a brief spectroscopic examination of the flames 
was carried out then, a more complete investigation seemed desirable, as it was 
thought that the results might have a bearing on the mechanism of carbon deposition 
from flames, a process about which comparatively little is known. 

There appears to be little published information on the spectra of flames con¬ 
taining oxides of sulphur, although the flame spectra of other sulphur compounds 
have received some attention. Fowler & Vaidya (1931) studied the flame spectra 
of sulphur, hydrogen sulphide and carbon disulphide, and the principal features of 
these were a strong band system due to sulphur, Sg, and a weaker band system of 
sulphur monoxide, SO. The cool-flame spectrum of CSg was investigated by Emeleus 



The spectra of flames containing oxides of sulphur 315 

( 1926 ), and, in addition to the above band systems, weak bands due to the CS radical 
were found to be present. 

In the present investigation a study has been made of the influence of added 
sulphur oxides on the spectra of 00, Hg, CH 4 , CgH^, CH 3 OH and coal-gas flames, 
and the results, taken in conjunction with those obtained from the chemical analysis 
of the products of combustion of flames containing added SOg, afford useful informa¬ 
tion on the role of sulphur compounds in combustion reactions. Some preliminary 
observations of the spectra of coal-gas flames containing HgS have also been made 
for comparison. 


Experimental 

Sulphur dioxide was obtained from a syphon, and hydrogen, carbon monoxide, 
methane and ethylene were obtained from cylinders and were used without further ^ 
purification or drying. Coal gas was burnt in a Bunsen burner and the other gases 
burnt at a qjiaftz jet. Small amounts of SO 3 were introduced into the flames by 
passing a slow stream of air through oleum and allowing this to mix with the main 
gas stream. 

The flame spectra were photographed on a Hilger small quartz instrument, this 
giving adequate dispersion for the purposes of this work, and a fairly wide slit was 
usually found adequate. 

Spectra and other characteristics op flames containing SO 2 

Coal gas. An ordinary fully aerated Bunsen flame shows the usual C 2 , CH and 
OH bands in the blue-green inner cone, and the OH band and weak CO flame 
spectrum in the pale blue-violet outer cone. When sulphur dioxide up to about 
4% by volume is introduced into a flame of this type the inner cone elongates 
slightly, due to a reduction in the flame speed, and becomes more blue-violet in 
colour, while the outer cone becomes much brighter. The spectrum of the inner cone 
now shows fairly strong SO bands (AA2877, 2968, 3164, 3271, 3383, 3502, etc.) and 
the CS bands around 2575 A, while the C 2 and CH bands are weakened; the Cg are 
probably weakened to a greater extent than the CH (plate 8 a and 6 ). The spectrum 
of the outer cone is continuous apart from the OH bands, the continuum extending 
from the blue to well down in the ultra-violet. 

Analysis of the interconal gases in this type of flame has shown that about 25 % 
of the original SOg is reduced to HgS and another 10 % to free sulphur. Small 
amounts of CS 2 and COS are also produced, but the amount of SO 3 formed in the 
inner cone is negligible. 

An under-aerated Bunsen flame is, of course, luminous, due to deposition of solid 
carbon, and the inner and outer cones are not clearly differentiated. If a Bunsen is 
adjusted so that the top of the flame is just luminous and then a little SO 2 is added 
to the gas supply the luminosity, due to solid carboh, disappears completely. The 
spectrum of a flame of this type containing SO 2 then shows fairly strong CS bands 



316 A. G. Gaydon and G. Whittingham 

and some S 2 bands in the violet and near ultra-violet and perhaps a trace of the SO 
bands. There is also a weak double-headed band, very similar in general appearance 
to the OH band, with heads at about 3236 and 3278 A; this is presumably to be 
identified with the SH band which has been reported in absorption by Lewis & 
White ( 1939 ) and has heads at 3236-6 and 3279-1 A. This band has never been 
recorded in emission before, despite many attempts to observe it. It does not occur 
in the fully aerated Bunsen flame, in the flame of hydrogen burning in air when SO 2 
is present, nor in a flame of burning H^S. Previous failures to obtain this band in 
emission have been attributed to predissociation. These weak bands in the under¬ 
aerated flame are superposed on a relatively strong continuous emission extending 
from the blue to around 2500 A, but the continuum in the visible due to incandescent 
carbon particles is suppressed by the addition of SO 2 (see plate 8 c and d). 

Hyd/rogen* A simple flame of hydrogen burning in air at a quartz jet is practically 
non-luminous, the spectrum showing only the strong ultra-violet OH bands. On 
addition of a few per cent of SO 2 the flame becomes blue-violet in coloqr and a weak 
blue inner cone is faintly visible. The spectrum of this faint inner cone shows fairly 
strong S 2 bands (see plate 8 c), but the SO bands have not been observed in this 
source. The outer cone of this flame (plate 8 /) shows continuous emission and the 
OH bands; the maximum of the continuum is around 3400 A, and it extends from 
the blue to around 2800 A (the limits ar^ not, of course, sharp, and can be consider¬ 
ably extended by increasing the exposure time). An aerated hydrogen flame shows 
a similar spectrum and again there is no sign of SO band structure. 

Carbon monoxide. The normal blue flame of carbon monoxide burning in air shows 
a continuous spectrum with maximum around 3700 A; the continuum extends from 
the green to about 2800 A, or further with long exposures. A weak band structure is 
superposed on this, and with undried gases the OH band at 3064 A is always strong. 

On addition of SOg the flame becomes rather more violet in colour and there is some 
alteration in the distribution of the continuous spectrum. The maximum appears to 
shift slightly towards shorter wave-lengths and the ultra-violet end is definitely 
extended. The OH band is practically eliminated by introduction of SO 2 . Plate 8 , 
g and h, shows the comparison between the normal CO flame and that with SO 2 
added. With SO 2 present the spectrum appears to be a summation of the continuum 
due to the CO flame and that due to SOg as shown in the Hg + SO 2 flame. 

Methane, CH 4 . The effect of SO 2 on methane is very similar to that on coal gas. 
In the inner cone of the fully aerated flame, bands of SO and CS appear but not S 2 , 
and the Cg and CH bands are weakened (see plate 96 ). With the under-aerated 
flame the SO 2 has again a marked effect in reducing carbon formation. 

Ethylene, C 2 H 4 . The observations were again similar to those with coal gas. 
Introduction of SOg into the aerated flame lengthened the inner cone slightly, 
weakened the Cg and CH bands and produced CS and SO bands and weak SH. In 
this case the Sg bands are fairly well developed in the outer cone. The hydrocarbon 
flame bands (HCO ?) which are relatively strong in this flaine are probably suppressed 
by the addition of SOg, but it is rather difl&cult to be certain of this as the strong 



The spectra of flames containing oxides of sulphur 317 

SO bands fall in the same region and mask the HCO bands when SO 2 is introduced 
{see plate 9 c and d). 

Methyl alcohol, CH3OH. A methyl alcohol-air flame was maintained by bubbling 
air through a gas washing bottle fitted with a sintered disk and containing methyl 
alcohol warmed to about 35° C, and burning the saturated vapour at a Pyrex jet. 
The normal flame showed chiefly OH and CO flame spectrum with some OH and 
weak O 2 bands. The addition of about 4 % SO 2 considerably strengthened the 
continuous emission and shifted it towards shorter wave-lengths and produced 
weak SO bands. The OH, CH and C2 bands were weakened. 

The effect of HgS oh ooal-oas flames 

When hydrogen sulphide is added to coal gas burning in a Bunsen burner the effects 
are generally very similar to those observed when SOg is added. With a luminous 
under-aerated flame the amount of carbon formed is reduced by the addition of 
HgS. The spectrum of an under-aerated coal-gas flame, the luminosity of which has 
been removed by means of H 2 S, is generally similar to that of the corresponding flame 
with SO 2 . There is considerable continuous emission, upon which faint bands of S 2 , 
SH and CS are superposed. The Sg bands are probably stronger than in the corres¬ 
ponding flame containing SO 2 while the CS and SH bands are less strong, the 
latter being barely discernible. 

With a fully aerated, i.e. Bunsen, flame the addition of HgS lengthens the inner 
cone, due to reduction in the flame speed, and the colour changes from blue-green 
to blue-violet. CS, SO and SH bands appear and Cg and CH are reduced in intensity. 
By comparison with the fully aerated flame containing SO 2 , the CS band produced 
by HgS is relatively strong. The SH is largely masked by the SO bands but the first 
head at 3236 A is clearly present; comparison of plates 86 and 9e shows the presence 
of the head when HgS is added but not with SO 2 . 

Spectra and other characteristics of flames oohtaihing SO3 

Goal gas. The most striking effect of the introduction of small amounts (of the 
order 0-1 % by volume) of sulphur trioxide into a Bunsen flame is the change from 
a clear to a luminous flame. This is most marked when the flame is initially adjusted 
to be just non-luminous, but formation of incandescent carbon particles occurs even 
with a fairly highly aerated flame. Separation of the two cones of the flame in a 
SmitheU’s separator shows that the carbon formation occurs in the inner cone, and 
that the inner cone is lengthened slightly when SO 3 is added. The spectrum of this 
inner cone shows, in addition to the continuum in the visible due to the hot particles, 
some SO bands and a trace of CS. The C 2 and CH bands are somewhat weakened by 
the introduction of SO3, but the effect is less marked than with SOg; the OH bands 
are not appreciably reduced in intensity (see plate 9/ and g). There is a slight 
increase in the intensity of the ultra-violet continuous emission from the outer cone, 
but this is very much less striking than with SOg. 



318 A. G. Gaydon and G. Whittingham 

Hydrogen. The addition of SO 3 to a hydrogen flame does not produce any marked 
change in the appearance of the flame. It becomes slightly more violet in colour, and 
the spectrum shows weak S 2 bands. 

Carbon monoxide. The addition of SO 3 produces little effect on the carbon monoxide 
flame. There may be some strengthening of the short-wave end of the continuous 
emission spectrum, but the effect is less marked than when SO 2 is added. There is 
no very obvious reduction in the strength of the OH band, as with SO 25 and no new 
bands appear in the spectrum; the concentration of SO 3 was, however, much less 
than that of SO 9 .. 


The reaction between suiiPHirR dioxide and atomic oxygen 

It has been shown by Dooley & Whittingham ( 1946 ) that the oxidation of SO 2 to 
SO 3 in flames is probably due to reaction between atomic oxygen and the SOg. The 
reaction seems to bear some resemblance to that between nitric oxide and atomic 
oxygen to form nitrogen peroxide. This latter reaction appears to occur as a direct 
association at room temperature and is accompanied by the emission of a con¬ 
tinuous spectrum with maximum in the yellow-green. It therefore appeared of 
interest to see whether there was any reaction between atomic oxygen and sulphur 
dioxide at room temperature. 

A simple discharge tube of the same general type as that used by Gaydon ( 1944 ) 
for studying the NO-0 reaction was used. Commercial oxygen passed through the 
discharge at a pressure of a few mm. of mercury and then mixed with a stream of 
SO 2 issuing from a narrow jet. A liquid-air trap served tcf protect the pump, and the 
tube was observed in a completely darkened room. The discharge through the 
oxygen is not very luminous, but quite a well-developed yellow-green air afterglow 
(from traces of nitrogen in the oxygen) was observed. The addition of a little SO 2 
did not produce any further glow or have any marked effect on the air afterglow. 
A fast stream of SOg did appear partially to suppress the air afterglow, although this 
effect may have been due, at least in p^t, to . the raising of the gas pressure in the 
discharge tube by the addition o‘f SO 2 , thus rendering the conditions less favourable 
for the production of an afterglow of long duration. Slight heating of the tube at 
the point of entry of the SO 2 failed to produce any new glow, but did slightly suppress 
the air afterglow. 

Thus it is clear that there is no evidence for a luminous reaction between SO 2 
and atomic oxygen at room temperature. It was, however, noticed that droplets 
were formed on the wall of the tube and tests showed these to consist of sulphuric 
acid. This presumably indicated formation of SO 3 either by three-body collisions or 
by direct association without emission of light; the former seems more probable, 
but the SO 3 molecule has six vibrational degrees of freedom and the possibility of the 
excess energy being stored in these, thus permitting direct two-body association 
without emission of light, cannot be entirely excluded. The formation of sulphuric 
acid at room temperature agrees with Geib & Harteck’s ( 1934 ) observation that in 



The spectra of flames containing oxides of sulphur 319 

the presence of atomic oxygen about 8 % of sulphur dioxide was oxidized to the 
trioxide at liquid-air temperatures. 

However, although there is no luminous reaction between SO 2 and atomic oxygen 
in the cold, it is possible that such a reaction may take place at high temperatures. 
It seems that in a collision between NO and 0 the particles may run together on a 
stable potential surface (i.e, one with a minimum), and that before the particles 
swing apart there is a small chance that a transition may occur to the ground state 
of NO 2 with the emission of radiation and stabilization to NO 2 . In the case of SO 2 
and 0 the collision may occur on a repulsive potential surface and the particle will 
then approach sufficiently closely to give an appreciable change of a transition to 
the ground state of SO3 with emission of radiation only if they collide with fairly 
high kinetic energy. 

The outer cones of flames containing SOg emit strong continuous radiation, the 
maximum intensity being around 3400 A and the continuum extending well towards 
shorter wave-lengths. The reaction 802 + 0 = 803 yields about Slkcal., which 
corresponds to a -wave-length of around 3500 A. Clearly radiation shorter than this 
could be emitted only if the colliding particles possessed kinetic energy. It is note¬ 
worthy that the flame in which the continuous emission is strongest, the CO flame, 
is that in which most oxidation of SOg occurs. This flame is also slightly hotter than 
the hydrogen flame, and the continuum definitely extends further to short wave¬ 
lengths (2500 A) in the CO flame than in the hydrogen flame (2800 A), 

Thus it is possible to make out quite a good case for the process responsible for 
the emission of the continuum from the outer cones of flames being identified with 
SO 2 +0 = SOg+fei' reaction. Against th^ it should be pointed out that disohaxge- 
tube sources containing oxides of sulphur also emit strong continuous radiation, and 
that the afterglow of SO 2 (Gaydon 1934 ) also emits a strong continuum which was 
believed to be due to the association process SO + 0 = SO 2 . On the whole it seems 
likely that the continuum is due to reaction between SOg and 0 to form SO3, but 
that the formation of SO 3 in flames is not necessarily dependent on this reaction 
with emission of radiation, as there is evidence that combination can also occur in 
the cold, either as the result of three-body collisions or as direct association without 
radiation. Assuming that the continuum is emitted by this reaction, its strength 
would be expected to increase with rising temperature, so that, even apart &om the 
possibility of three-body collisions for min g SO 3 , one could hardly expect the rate 
of formation of SO 3 in flames to lead to a strictly quantitative measure of the 
concentration of atomic oxygen. 


Discussion of the band spectba 

In discussing the spectroscopic observations it must first be remembered that the 
observations are of emission from electronically excited molecules or radicals. This 
emission may, in part, be thermal in origin, but there is good reason to believe that 
chemiluminescence is also important in determining'the emission from flames. Thus 


Vol. 189. A. 


21 



320 


A. G. Gaydon and G. Whittingham 

suppression of a band system, as, for example, the OH band when SOg is introduced 
into a flame of moist CO, might mean removal of the radical by some chemical 
reaction, e.g. 

SO2 + OH = SOs + H-20koal., (1) 

but it might equally well be explained as deactivation of the electronically excited 
radical without chemical action. Production of the band spectrum of a molecule 
or radical may, similarly, be due to its formation in an excited state (chemi¬ 
luminescence) or to its formation in the normal state followed by thermal excitation. 

In order to explain the various conceivable reactions which might account for the 
formation or removal of molecidar species or radicals some knowledge of the thermo¬ 
chemistry of the process is desirable. Unfortunately, the thermochemistry of many 
of the less stable compounds of sulphur is still in doubt, e.g. the dissociation energies 
of Sj, and CS. In the following discussion the values recommended by Gaydon 
(1947) for the heats of formation (from free atoms) are used. These are (expressed 
inkoal.): C283?, OH80, CO256, HjlOS, OallT, OHlOl, SjlOl, S0119, CS166, 
H2O219, H2O2263, CO238I, SOa262, SO3333, SH70?. The energies of excitation 
used are: OH* 67, OH* 93, €2*65, CS*104, SO* 103, SH*88 (throughout the 
discussion an asterisk denotes an electronically excited molecule). 

One of the most striking experimental observations is the effect of SO2 on hydro¬ 
carbon flames. Carbon deposition in an under-aerated flame is suppressed and CS 
bands are emitted with a fairly high intensity. In a fully aerated flame the intensities 
of the C2 and CH bands in the inner cone are reduced and bands due to SO and 
CS appear, the latter being weaker than in the under-aerated flame. 

Among the possible reactions leading to the formation of CS and SO are: 


C + S02= CO-f-SO* + 20kcal., (2) 

0 + S02 = CS-h02-l-31kcal., (3) 

C2 + SO2 = CS* + CO2 +108 kcal., • (4) 

C^ + SOj = CS* + C02-H63kcal., (6) 

C2 + SO2 = C-i-CO + SO-l-40kcal., (6) 

Cf -(- SO2 = C+CO -I- SO* - 8 kcal., (7) 

CH* + SO2 = CS + H -t- Oa +18 kcal., (8) 

CH*-hS02 = CS-l-H02±?, (9) 

CH*-)-S02 = CO-l-H-hSO-mOkcaL, (10) 

CH* + S02= CO-i-H-l-SO* + 7kcal., (11) 

CH + S02 = C0-!-H-t-S0 + 43kcal. (12) 


Thus the emission of the CS band from hydrocarbon flames is more readily 
accounted for in terms of reactions (4) and (6), as some degree of thermal excitation 
would be required in (3). The only reactions producing SO* either require free carbon 
atoms or else are slightly endothermic. The fact that the SO bands are strong in the 



The spectra of flames containing oxides of sulphur 321 

ho.t inner cones of fully aerated flames where the Cg and CH bands are intense, 
suggests that the SO bands are due to reactions (7) and ( 11 ) rather than ( 2 ). 

Reactions such as ( 6 ) and ( 12 ), which form normal SO, followed by thermal 
excitation, seem less likely as the SO bands are largely restricted to the reaction zone 
of the inner cone. 

The reduction in the strength of the OH bands when SOg is present may be ex¬ 
plained by reaction ( 1 ). It is of interest to note here that all the bimolecular reactions 
which lead to the formation of SO or SO* involve the formation of CO as well. 
With the high value for the dissociation energy of CO used here (256kcal.) this 
' presents no difficulty, but if a lower value for the dissociation energy were used, all 
these reactions would liberate much less energy, becoming mostly very endothermic. 
This constitutes an indirect argument in favour of the high value for the dissociation 
energy of CO. 

The occurrence of the weak SH bands in the under-aerated flames of hydrocarbons 
is interesting, especially because the band is not known to occur in the flame of 
H 2 + SOg or in the simple flame of burning HgS. It is thus improbable that it is due 
to reaction between H, Hg, or OH, and SOg. It may be formed by some subsequent 
reaction of either CS or SO formed during the combustion of hydrocarbons in the 


presence of SOg. Possible ijeactions are: 

SO* + Hg = OH+SH*- -36kcaL, (13) 

SO + CH = C0 + SH*-4- -39kcaL, (14) 

CS + OH = CO+SH*- -2.9kcaL, (15) 

CS* + 0H = CO+SH*+ -75kcal., (16) 

CS* + CH* = Cg+SH* ^ -10 kcal. (17) 


Since the SH band occurs under the conditions in which the CS bands are strongest, 
i.e. an under-aerated coal-gas flame containing SOg and an aerated flame containing 
HgS, and the SO bands are not present with appreciable strength, reactions (15) or 
(16) seem the most likely. 

The origin of the Sg bands in the hydrogen flame is not entirely clear. They are 
known to be a persistent impurity in certain flames containing hydrogen and 
appear especially if oxygen is burnt in an atmosphere of hydrogen containing a trace 
of sulphur, or if an impure coal-gas flame is brought into contact wdth a cool surface. 
Brinsley & Stephens ( 1946 ) have reported that as little as one part by weight of 
sulphur in 4 X 10 ® parts of hydrogen is sufficient to produce the Sg bands. M^st of 
the bimolecular reactions which could form Sg in the Hg flame seem too endothermic 
to appear likely. Brinsley & Stephens suggest the equilibrium 

2SO^SOg+|S2 

as responsible for the formation* of sulphur. 

A suggestion made to us by Mr H. G. Crone is that any sulphur present might be 
strongly excited by the reaction 

H+H + S2-S|4-Hg. 


21-2 



322 A. G. Gaydon and G. Whittingham 

There would be sufficient energy from the combination of H atoms to excite S 2 , 
but the margin is not very great, and this may account for the fact that only transi¬ 
tions from a few of the lower vibrational levels of the excited state are observed in 
this flame. A similar reaction with SO as the third body would not lead to formation 
of SO* because the energy is insufficient. SO may be present in Hg 4- SOg flames but 
in an unexcited state, for example, through 

H + S02=:S0 + 0H, 

a reaction which is slightly endothermic. 

The introduction of small amount? of SO 3 into hydrocarbon flames, in addition 
to producing solid carbon, causes slight weakening of the Cg and OH bands and the 
appearance of some SO bands. Deactivation of excited molecules and radicals may 
play some part in reducing the Cg and OH bands, as SO3 has a large ^collision dia¬ 
meter but the SO emission and also this reduction in the strength of the Cg and CH 
bands may be attributed to the action of SOg arising out of the thermal decomposi¬ 
tion or reduction of SO^. 


The mechanism of carbon formation in hydrogen flames 

Before discussing the role of sulphur oxides in relation to the problem of carbon 
formation in flames, a subject which has a direct bearing.on ‘coking’ in engines, it 
is relevant to review previous work on the conditions under which Og and CH bands 
occur in combustion processes. 

Not only are Cg and CH bands given by the flames of hydrocarbons burning at 
atmospheric pressure, but they also appear when hydrocarbons (other than methane) 
are subjected to the action of atomic hydrogen (Bonhoeffer & Harteok 1928 ) and 
atomic oxygen (Geib & Vaidya 1941 ) at low pressure and room temperature. It is 
considered that successive stripping of hydrogen atoms from the hydrocarbon 
molecule will lead to the formation first of free radicals and finally of CH and Cg. 
As an alternative possibility the CH may be formed from Cg by reaction with either 
OH or H atoms. We have also observed that when chlorine is added to a Bunsen 
flame there is a very marked strengthening of the Cg emission, accompanied by an 
increase in the flame speed, but the effect on the amount of carbon formation is not 
great; this strengthening of Cg could again be explained as resulting from stripping 
of hydrogen atoms from the hydrocarbon by chlorine. 

Hsieh & Townend ( 1939 ) found that mixtures of hydrocarbons and air burning 
under reduced pressure gave intense green flames, the colour being due to emission 
from Cg, and the excitation of CH and Og was greatly dependent on the pressure and 
composition of the mixture. The emission of Cg was related to the initial stages of 
hydrocarbon combustion and also depended on the extent to which oxygen was^ 
deficient. Smith ( 194 ^) followed this work up by observing that there was an abrupt 
change from this green-flame condition to that of carbon formation in which the 
Cg band emission disappeared. 



The spectra of flames containing oxides of sulphur 323 

It seems highly probable that the polymerization of Cg and of hydrocarbons has 
some bearing on the problem of solid carbon deposition. The high-temperature 
coefficient of the polymerization of Cg to solid carbon was remarked on by Edemenc, 
Wechsberg & Wagner (1934) in their study of the decomposition of carbon suboxide 

C3O2 = C2 + CO2. 

Some observations on the pyrolysis and polymerization of hydrocarbons have been 
made which seem relevant to this discussion. In the pyrolysis of methane the 
influence of free radicals in inducing this process has been demonstrated by Hessels, 
van Krevelen & Waterman (1939). One important fact arising from their work was 
that the ratio of inductor to methane was very critical in determining the course of 
the reaction. The polymerization of ethylene is very sensitive to the presence of 
oxygen (Lenher 1931,1932), and Storch (1934) has shown that one oxygen molecule 
leads to the polymerization of 85 molecules of ethylene, thus indicating some chain 
process in the polymerization. It was also found by Taylor & Jones (1930) that 
alkyl radicals, formed by decomposition of metal organic compounds, induce 
polymerization in ethylene. 

In Storch’s later work (1935) it was found that the polymerization of ethylene 
was inhibited by small traces of sulphur compounds, particularly H2S, this effect 
being attributed to the anti-oxidant effect of the sulphur compounds. It has been 
suggested that the inducing effect of oxygen might be due to the formation of 
peroxidic bodies or complexes which subsequently decompose to free radicals which 
maintain the polymerization chains. With methane there is some evidence, from the 
work of Waterman & Van Vlodrop (1939) on the influence of HgS on the pyrolysis, 
that at temperatures above 1000° 0 carbon deposition is very small and H2S is 
converted to CS2. 

The suppression of carbon formation in luminous flames by the addition of SO2 
or HgS can be attributed to either interference with the sequence of reactions 
leading to the formation of carbon aggregates, or reaction with the carbon aggregates 
once formed, or both. The reaction of solid carbon with SO2 has been studied by 
Lepsoe (1940), and is extremely rapid above 900° C; this, and the observation that 
the CS band is produced strongly when carbon deposition is being suppressed, 
suggests that removal of solid carbon is at least contributory to the reduction of 
luminosity when SO2 or H2S is added to a flame. On the other hand, the reduction 
in flame speed, noticeable by the lengthening of the inner cone, indicates that SO2 
and H2S do also interfere with the combustion reactions. Consequently, removal of 
radicals by SO2 or II2S is probable, and this niay reduce polymerization and carbon 
formation. 

The amount of SO3 required to produce a marked increase in carbon deposition is 
very small, about 0*1% by volume. This clearly could not produce a significant change 
in flame temperature and so could not have a purely thermal effect on polymerization 
of C2 or hydrocarbons. Furthermore, since SO3 is likely to be rapidly destroyed 
in the inner cone of a Bunsen flame its effect must be confined to the early of 



324 A. G. Gaydon and G. Whittingham 

combustion. This appears to indicate that SOg must in some way initiate some chain 
process the end-product of which is solid carbon. The effect of SOg on the Cg bands is 
not very marked, and it has already been seen that polymerization of hydrocarbons 
such as methane and ethylene tends to proceed by a chain mechanism in which 
free radicals are probably important. Hence it seems most probable that the action 
of SOg is to initiate polymerization of hydrocarbons, the higher polymers of which 
subsequently decompose thermally to solid carbon. It is uncertain whether the SOg 
first reacts with the hydrocarbon to form a peroxide, which subsequently decomposes 
to a radical which initiates the polymerization, or whether the SOg attacks the 
hydrocarbon with formation of OH or HgO, SOg and a free radical. It may be noted 
in passing that the effect of SO, in inducmg carbon formation is so much greater 
than {ihat of SOg in suppressing it, that the second-order effect of the SOg formed by 
reaction of SOg will be practically negligible. 

Thus one may conclude that present observations of the influence of SOg, HgS 
and SOg on carbon formation in flames of burning hydrocarbons are best explained 
by assuming that the carbon? formation in flames depends on chain reactions, 
maintaaned by free radicals, which lead to polymerization of the hydrocarbons, 
followed by their thermal decomposition. SOg and HgS reduce carbon deposition 
partly by removing the solid carbon already formed and partly by interfering with 
the radicals (or the peroxides from which they are formed) which maintain the 
polymerization chains. The strong effect of very small amounts of SOg in inducing 
carbon deposition may be ascribed to its action in initiating the polymerization cff 
the hydrocarbons either by direct formation of free radicals from the hydiocarbou 
or by indirect formation of these radicals via a peroxide. 

One of us (A. G. G.) is grateful to the Royal Society for the award of a Warren 
Research Fellowship, and orfr thanks are due to Sir Alfred Egerton, Dr D. T. A. 
Townend and Dr D. H. Bangham for . their interest in and helpful criticism of the 
work. 


Rbi'brenoes 

Bonhoeffer, K. B. & Horteck, P. 1928 Z. Phys. Chem. 139, 64. 

Brinsley, F. & Stephens, S. 1946 Nature, 157, 622. 

Dooley, A. & Whittingham, G. 1946 Trang. Pa/raday Soo. 42, 364. 

Eriael 6 ns, H. J. 1926 J. Chem. Soc. p. 2948. 

Fowler, A. & Vaidya, W. M. 1931 Proa. Boy. 800 . A, 132, 310. 

Gaydon, A. G. 1934 Proc. Boy. 80 a. A, 146, 910. 

Gaydon, A. G. 194 ^ Proc. Boy. 800 . A, 183, 111 . 

Gaydon, A. G. 1947 Dissociation energies. London; Chapman and Hall. 

Geib, K. H. & Harteok, P. 1934 Trans. Faraday 80 c. 30, 134. 

Geib, K. H. & Vaidya, W. M. 1941 Proo. Boy. 800 . A, 178, 361. 

Hessels, W. J., van Krevelen, D. W. & Waterman, H. I. 1939 7. 80 c. Chem. Ind., Lmd., 
58, 323. 

Hsieh, M. S. & Townend, D. T. A. 1939 J- Ohem. 80 c. p. 332. 

Klemenc, A.,'W 6 chsberg, R. & Wagner, G. 1934 Z. Elektrochem. 40, 488. 







1 

cs 

OH SH 

$2 bands CH 


(e) (inner C0N£j4-SO2 

_i^_1_1_ 

3000 

1 / _ 1 t i 

3500 4000 4500 A 

t 1 1 1 1 1 i f i / f t 1 


1 

t 

1 



(f) H2 (outer C0N£)“f-SO2 

OH 

$2 BANOS 




(Facing p. 324) 






















The spectra of flames containing oxides of sulphur 325 

Lenher, S. 1931 J. Amer. Ghem. Soc. 53, 3752. 

Lenher, S. 1932 J. Amer. Ghem, Soc, 54, 1830. 

Lepsoe, B. 1940 Indmtr, Engng Ghem, 32, 910. 

Lewis, N. & White, J. U. 1939 Phys, Rev, 55, 894. 

Smith, E. C. W. 1940 Proc, Roy. Soc. A, 174, 110 . 

Storch, H. 1934 J. Amer. Ghem. Soc. 56, 374. 

Storch, H. 193 s J. Amer. Ghem. Soc. SI, 2698. 

Taylor, H. S, & Jones, W. H. 1930 J. Amer. Ghem. Soc. 52, 1111 . 

Waterman, H. I. & van Vlodrop, C. 1939 J. Soc. Ghem. Ind., Lond., 58, 109. 

Whittingham, G. 1945 Nature, 156, 207. 


Description oe Plates 8 and 9 

All spectrograms were taken on a small quartz spectrograph and except for strips a, b, e, 
d of plate 8 , a wide slit was used throughout. 

Plate 8 



Exposure 

Type of plate 

a. Inner cone of Bunsen flame 

60 min. 

Ilford Golden 
Iso-Zenith 

h. Inner cone of Bunsen flame containing 4 % SOg 

60 min. 


c. Luminous coal-gas flame (under-aerated Bunsen) 

30 min. 

•» 

d. Luminous coal-gas flame (under-aerated Bunsen) 

4 hr. 


+ 4%SOi, 

e. Inner cone of aerated Hg flame containing 2 % SOg 

20 min. 

Zenith 

and burning at a quartz jet 

/. Tip of outer cone of aerated H^ + 2% SOg flame. 

16 min. 

»» 

Fe arc comparison 

g. Flame of CO burning at a quartz jet. Fe arc com- 

J min. 


parison 

h. Flame of CO + 2 % SOg burning at a quartz jet 

IJ min. 


Plate 9 

a. Inner cone of aerated methane flame 

10 min. 

H.P.3 

b. Inner cone of aerated methane flame + 2% SOg 

10 min. 


c. Aerated ethylene flame, inner and outer cones 

3 min. 


d. Aerated ethylene flame, inner and outer cones + 2 % SOg 

3 min. 


e. Inner cone of Bunsen flame containing HgS 

4 min. 

Zenith 

/. Inner cone of Bunsen flame 

3 min. 

H.P.3 

Inner cone of Bimsen flame -h 0*1 % SO 3 

3 min. 




On the simple group of order 25920 

By J. a. Todd 

{Communicated by H. F. Baker, F:B. 8 .—Received 4 Jrnie 1946) 

ExpKcit expressions for the operations of the simple group of order 26920 as products of five 
or fewer operations of period two are obtained, leading to a determination of the conjugate 
sets of operations contained in the group. The results are applied to the determination of the 
nature of certain important subgroups, and to some questions relating to the generation of 
the group by two operations. 


Ihtrodttctioit 

The simple group Q of order 25920 has formed the subject of a large number of 
investigations. It was first considered by Jordan (1870) as the group of even substi¬ 
tutions of the 45 tritangent planes of a general cubic surface. Jordan proved that 
the group was simple, and identified it with the group arising in the problem of 
trisection of the periods of quadruply periodic hyperelliptic functions. From this 
latter point of view the group was studied at length by Burkhardt, who obtained 
various representations of the group by linear transformations. In particular, he 
showed (Burkhardt 1890) that the group is simply isomorphic with a collineation 
group in space of four dimensions leaving a certain quartic primal invariant, and 
obtained a set of four particular operations which generate the group. The primal 
in question possesses 45 nodes, which are in (1,1) correspondence with the 45 
tritangent planes of a general cubic surface, and form a remarkable configuration. 
The geometry of this configuration has been developed by Baker in his recent tract 
(Baker 1946). Baker has drawn attention to the importance of 46 particular opera¬ 
tions of the group, which he has termed projections^ each of which is a harmonic 
inversion having as centre one of the 45 nodes. He proves, among many other in¬ 
teresting results, that Burkhardt’s four generators can all be expressed in terms of 
these projections, which accordingly generate the whole group. 

The importance of these projections rests on the fact, proved below, that they 
form a complete set of conjugate operations in (?. It will be proved in the present 
paper that every operation of the group can be expressed as a product of five or 
fewer of these projections, and shall express every operation of the group in such a 
form. It will be seen how this method of expressing the operations enables one to 
enumerate the various conjugate sets belonging to the group, confirming the results 
obtained (from a very different point of view) by Frame (1936); to determine readily 
the periods of the different types of operation; to enumerate the operations of 
certain important subgroups of (?; and to verify results obtained by Coxeter, 
Brahana and Frame (referred to in § 8 below) concerning various simple ways in 
which 0 can be generated by a small number of operations, and to obtain other 
results of a similar nature. 


[ 326 ] 



On the simple group of order 25920 327 

1. DbSOBIPTION OB' A TONDAMENTAL CONFIGURATION IN FOUR DIMENSIONS 

This paper is concerned with the group Q of collineations in four dimensions which 
leaves invariant a certain configuration of 46 points, and it is the purpose of the 
present section to describe this figure, and certain subsidiary figures contained in it, 
in sufficient detail to make the rest of the paper intelligible. Proofs of all the state¬ 
ments made here will be found in Baker’s tract (Baker 1946), or may easily be 
deduced from Baker’s results. 

Take a coordinate system in S^ia which a point is specified by the ratios of six 
numbers Xi, x^,...,Xq whose sum is"zero, and consider the 30 - 1-16 points (ij,hi,mn) , 
(ij) defined, respectively, by 

Xi = x^ = l, Xk = xi = e, x„^ = x^ = e^, 

and Xi=l, xj = -l, x^ = Xi = x^ = x.^== 0 , 

where {ijMmn) is a permutation of ( 123466 ) and e,is a primitive cube root of unity. 
It is easily seen that 

{ij, hi, mn) = {ji, hi, mn) = {hi, mn, ij) = etc., {ij) = (Ji). 

These 45 points are the nodes of the quartic primal whose equation is 

2 x^XjXjcXi == 0 , 

the surnmation extending over the 16 sets of four taken from among the suffixes 
1, 2,..., 6; and it will be convenient to refer to them throughout the paper as nodes, 
although explicit reference to the primal will not be made in the sequel. 

The group 0 of collineations, which keeps the set of nodes as a whole fixed, permutes 
the nodes transitively (that is, it contains operations which change any given node 
into any other), and it also permutes trsCnsitively the different configurations derived 
from these nodes which wiU now be d^oribed. 

With each node is associa^d a definite prime, ■vfrhich will be called the polar prime 
of the node. The polar primes of the nodes {ij, hi, mn), {ij) have respective equations 

{Xi + Xj ) -f 6^{Xk 4- Xi ) + e { x ^+*„) = 0 , Xi-Xj = 0 . 

The 45 primes so arising will be called, following Baker, Jordan primes. 

With respect to any given one of the 48 nodes, A say, the remaining nodes fall 
into two sets, comprising 12 and 32 nodes respectively. The 12 nodes in the former 
of these two sets lie in the polar prime of A, and the line joining any one of these to 
A contains no further node. Such lines will be called e-lines. There are |. 45 .12 = 270 
e-lines, each containing two of the nodes, 12 of these lines passing through each 
node. The 32 nodes which, do not lie on e-lines through j 4 lie in pairs on 16 lines 
through A. Such lines, which are called K-lines, each contain three nodes, and there 
are J. 46.16 = 240 of them, sixteen pairing through each node. The polar piim^ of 



328 


J. A. Todd 


three nodes lying on a /c-line meet in a plane containing a second /c-line; two such 
/C“lines are symmetrically related and either is called the polar K-line of the other. 
A pair of polar /c-lines consists, necessarily, of two skew lines, and the nine lines 
joining a node on one /c-line to a node on its polar are all e-lines. 

It will be important in what follows to be able to recognize whether two nodes 
whose symbols are given are joined by an e-line or a /c-line, and in the latter case to 
determine the third node which lies on the /c-Hne. To this end the sets of three nodes 
which lie on /c-lines are listed; all such sets have one of the three forms 

(y)> 

(ij), rrhn), {jk, il, mn); 

{ij, kl, mn), (il, kn, mj), {in, kj, ml). 

It wiU be noticed, in particular, that the 12 nodes lying in the polar prime of (ij) 
are these—other than (ij) —^whose symbols are unaltered by the interchange of 
i and j, while pairs of nodes coUinear with ij are such that their symbols are inter¬ 
changed when i and j are interchanged. 

There are planes, of four distinct types, each containing three or more nodes which 
are not coUinea?, as follows: 

(i) There are 270 planes, which will be called f-planes, each containing three of 
the nodes and the three lines, which are e-lines, joining them in pairs. Each of these 
planes is the intersection of the polar primes of two other nodes, uniquely deter¬ 
mined by the plane, whose join is an e-line. These nodes, together with the three 
that lie in the plane, form a symmetrical set of five nodes such that the polar prime of 
any one of them contains the other four; the join of any two of these nodes is an 
e-line and the plane containing any three is an /-plane. Such a set of five nodes is 
called a Jordan pentahedron, and there are 27 such, three of which contain any given 
node. In particular, the five nodes 

{ij), {kl), {mn), {ij,kl,mn), {ij,mn,kl) 

form a Jordan pentahedron, a fact which will be used in the sequel. 

(ii) There are 720 planes, called d-planes, each of which joins a /c-line to a node 
l3dng on its polar /c-line. Each ci-plane contains four nodes, three c-lines and one 
-AT-line. 

(iii) There are 540 planes, called cross-planes or c-planes, each containing six 
nodes, which are the vertices of a quadrilateral formed by /c-lines. Each c-plane 
contains six nodes, three c-lines (the diagonals of the quadrilateral) and four /c-lines. 

(iv) There are 40 planes, called Jaoobianplanes oxj-planes, each of which contains 
nine of the nodes, and twelve /c-lines. The nine nodes form the well-known configura¬ 
tion formed by the inflexions of a pencil of plane cubic curves; the join of any two 
of them is a /c-line and contains a third node, and the four /c-lines which pass through 
any one of the nodes form an equianharmonic pencil. The 12 /c-lines can be divided 
into four sets of three, each set forming a t:|jiangle whose sides contain all the nine 
nodes. 



On the simple group of order 25920 329 

There are likewise primes, of four types, each containing four or more nodes which 
are not coplanar, as follows: 

(i) It can easily be shown that each cross-plane lies in a definite^Jordan prime, and 
that the polar /c-lines of the four /c-lines which lie in the plane pass through the pole 
of the prime (i.e. the node whose polar prime is the Jordan prime in question). 
The 640 primes, called x-primea, which join the cross-planes to the poles of the 
Jordan primes in which they lie, each contain seven nodes, nine e-lines, four /c-lines, 
three jf-planes, four d-planes and one c-plane. 

(ii) There are 216 primes, called n-primes, each containing ten nodes and ten 
/c-lines which are the vertices and edges of a pentahedron of cross-planes. Each 
^-prime contains 10 nodes, 15 e-lines, 10 /c-Hnes, 10 d-planes and five c-planes.,The 
prime containing the ten nodes 

(12), (13), (14), (15), (23), (24), (25), (34), (35), (45) 

is an example. 

(iii) The 45 Jordan primes already mentioned each contain 12 nodes, which fall 
into three sets of four, forming a triad of desmic tetrahedra, any two of which are in 
perspective from each vertex of the third. Each of these tetrahedra, together with 
the pole of the prime, forms a Jordan pentahedron. Each Jordan prime contains 
10 nodes, 18 e-lines, 16 /c-lines, 12/-planes and 12 c-planes. 

(iv) There are 40 Steiner primes, each of which contains four Jacobian planes 
which form a tetrahedron whose pairs of opposite edges are polar /c-hnes; each edge 
of the tetrahedron thus contains three nodes. A Steiner prime contains 18 nodes, 
27 e-lines, 42 /c-lines, 18 d-planes, 27 c-planes and four/-planes. 

Elementary combinatorial considerations show that the lines, planes and primes 
just enumerated comprise all the linear spaces defined by subsets of the 45 nodes. 
Some further properties of the configuration are added which will be of use later. 

There are four Steiner primes containing any given Jacobian plane. Each of the 
four triangles formed from three /c-lines in the plane which contain between them the 
nine nodes is a face of the tetrahedron of Jacobian planes in one of these primes. 
And any prime which contains a Jacobian plane and a node not lying in this is a 
Steiner prime. If 27 is a given Steiner prime, there are 12 other Steiner primes whose 
intersection with 27 is a Jacobian plane. The remaining 27 Steiner primes meet 27 
in cross-planes, one such prime passing through each of the cross-planes in 27. And 
the 27 Jordan primes which contain the cross-planes of 27 are all distinct, and are the 
polar primes of the 27 nodes which do not lie in 27. 

Erom the 27 Jordan pentahedra it is possible, in 72 ways, to select a set of six 
pentahedra whose vertices are all distinct. Each such set determines uniquely a 
second set whose pentahedra contain, collectively, the same 30 vertic^. Thus there 
are 36 double-sixes of penMhedra. The 15 nodes which are not included among the 
vertices of a double-six are the vertices of a hexahedron of ^-primes, and the (ten) 
pairs of opposite edges of the hexahedron are polar /c-lines. Such a set of nodes, of 
which there are 36, niay be called a 15-set, a simple example being provided by the 
15 nodes (i/). 



330. 


J. A. Todd 


2. The simple geottp 0 astd the peojbctions which oeheeate rr 

It is known (Coble 1906 , p. 366) that it is possible to set up a ( 1 ,1) correspondence 
between the 46 nodes of the figure described in § 1 and the 46 tritangent planes of a 
general cubic surface in such a way that the tritangent planes corresponding to two 
nodes A, B meet in a line of the cubic surface if AB is an e-line and have no line of 
the surface in common if AB is a /c-line. It follows that the group Q* of automor¬ 
phisms of the 46 nodes (that is, the group of permutations of the nodes among 
themselves which preserve incidence relations) is simply isomorphic with the group 
of automorphisms of the tritangent planes. This group was studied by Jordan, who 
showed (Jordan 1870 ) that it contains a self-conjugate subgroup 0 - which is simple 
and of order 26920; this group consists, in fact, of the group of automorphisms which 
effect an even permutation of the 46 tritangent planes (Dickson 1901 , p. 306). I 
have shown elsewhere (Todd 1936 , p. 173) that every operation of Q* corresponds’ 
to a eoUineation or anticoUineation in 84 , which transforms the set of nodes into 
itself. The subgroup formed by the coUineations belonging to 0 * is of index 2 , and 
it is clearly self-conjugate. Since Q* is known to contain only one such subgroup it 
follows that the group of coUineations, of order 26920, is simply isomorphic with 0 
(and will in future be denoted by 0 ). 

Baker has shown in his tract (Baker 1946 , § 7) that the configuration is left un¬ 
altered by the harmonic inversion with respect to any node A and its polar prime 
This harmonic inversion, which wiU be denoted by p{A) and called the projection 
finm the node A, is an operation of Q which leaves unaltered the node A and the 
12 nodes in the polar prime of A, and which interchanges in pairs the two nodes 
other than A on each of the 16 /c-lines which pass through A. Thus p{A) effects an 
even permutation of the 46 nodes. 

Suppose, now, that B, O, D are further nodes.such that B and C lie on a /c-line 
through A, and D lies in the polar prime of A. Let T be any operation oiO and let 
A', B', G\ D' be the nodes into which A, B, 0, D are respectively transformed by T. 
Then, since T isan automorphism of the configuration. A'B'O' is a /c-Mne and D' hes 
in the polar prime of A'. Let T' = Tp{A) T-^ be the transform of p{A) by T. Then, 
since p(A) leaves A and D unaltered and interchanges B and G, 

T'[A'] = Tp{A) r-i [A'] = Tp{A) [A] = T[A] = A', 

T'[B'] = Tp{A) T-^[B'] = Tp{A) [B] = T[G] = O', 

T'[G'] = TpiA)T-^[G'] = Tp{A) [0] = T[B] = B', 

T’[D'] = Tp{A) T-i[D'] = Tp{A) [D] = T[D] = D', 

so that T' keeps fixed A' and every node in its polar prime, and interchanges the two 
nodes, other than A', on each /c-Une through A'. Thus T' is the projectionp(A'), for 
distinct operations of 0 effect distinct permutations of the nodes. It has thus been 
proved that 

Tp{A) T-^ = p(A'), where A' = T[A]. 



331 


On the simple group of order 25920 

Thus any operation of 0 which is conjugate to a projection is itself a projection. 
And, since 0 is transitive, there exist operations of 0 which transform A into any 
one of the other nodes. Thus all the 45 projections are conjugate. Hence the 45 pro¬ 
jections form a complete conjugate set in 0, The subgroup generated by them is thus 
self-conjugate in G, and hence, as Q is simple, coincides with O itself. Thus Baker’s 
statement that G is generated by projections is verified; and since each projection 
effects an even permutation of the nodes the same will be true for each operation of 
G. Observing, further, that the anticoUineation keeps fixed the 16 nodes {ij) 

and interchanges the 15 pairs of nodes such as {ij, hi, mn), {ij, mn, hi), thus effecting 
an odd permutation of the nodes, it is seen at once that every anticoUineation in 
(which is the product of the one considered with a colhneation of Q) effects an odd 
permutation of the nodes, so that every even automorphism is determined by a 
colhneation. 

The fundamental relations connecting the various projections are given by Baker 
( 1946 , §7) and may be expressed in the following form (in which the identical 
operation of G is denoted by 1 ): 

[^(A)]2 = 1 for each node A, 
p{A)p{B) = p{B)p{A) if AB is an e-line, 
p{A)p{B) = p{B)p{C) = p{0)p{A) if ABC is a /c-line, 
p{A)p{B)p{C)p{D)p{E) = 1 a ABODE is a Jordan pentahedron. 

I propose to show how, by means of these relaljons, every operation of G can be 
expressed as a product of five or fewer projections, to classify these operations and 
obtain their periods, and to determine the various conjugate sets contained in G. 
It wiU be seen that there are fifteen types of operation, the operations of any one 
type all being conjugate in the group of automorphisms G*, and that in the 9 ase of 
five of these types the operations form two conjugate sets in G, For purposes of 
reference, these types wiU be denoted by Roman numerals, and I shaU commence 
with two types whose existence is already clear, namely, 

I. The identical operation. 

II. The 45 projections, each of period 2 , which form a conjugate set. 

In determining the arrangement of the operations of 6 ? in conjugate sets frequent 
use win be mad.e of the principle that if a set of nodes A^, ..., is transformed by 
an operation T of G into a set ...,A', then the operations p{A^,,p{A^) and 
!p{A'-j).,,p{A'^), products of the projections from these sets of nodes, are conjugate, 
a result which is immediately evident if one observes that 

Tp{A^Y.p{A^) T-i = Tp{A^) T-\.:Tp{A^) 

^p{A{)...p{A'^), 

by the result proved above. Two sets of nodes A-^, and Aj,related in 
this way wiU b^caUed equivalent with respect to <?, and the aggregate of operations 
conjugate to a product of projections p{A-f^,,,p{A^) consists of the procEucts 
of projections from aU sets of nodes equivalent to 




332 


J. A. Todd ^ 

In the course of subsequent work it will frequently be necessary to transform 
products of projections, usually with the aim of simplifying the product under 
discussion or of reducing it to some standard type. The transformatioM are based 
enturely on equations (1). But one particular type of transformation is of such 
frequent occurrence that it is described explicitly here. From (1) it follows that any 
product p{A)f{B) of two projections can be expressed in each of the forms 
p{B) p{A') orp(B') p(A), where, if AB is an c-line, A' = A md B' = B, while if AB 
i^a ^-line. A' = B' = O, where 0 is the remaining node of the /c-line. It follows 
that m any product of projections any given factor p{A) may be moved one or more 
places to the right (or left) without altering the number of projections in the product 
and without altering any factor which is not passed over hjp(A). More generally 
two or more factors may be moved about in the same way (with consequent changes 
only m the factors passed over) so long as they retain their original order. Any 
transformation of this type wiU be referred to as an dementmy transformaUon of the 
product, and it wiU be assumed that such transformations will be-recognized as 
such by the reader after a few examples have been given.f The other transformations 
involve either the canceUing of two consecutive factors p(A),p{A), which calls for 
no comment, or the replacement of the product of projections from a set of nodes 
belongi^ to a Jordan pentahedron by the product of projections from the remaining 
nodes of the pentahedron. When this is likely to cause any dlfBoulty to the reader 
a footnote has been added by way of explanation. 

By way of example two elementary results are proved. Calling a product of 
projections redwible if ifrcan be expressed as a product involving fewer factors 
tne lollowmg theorems are proved: 

(i) Any product which contains a repeated factor is reducible. 

(u) Any product which corOmns three distinct factors p{A), p(B) plOV-not 
nec^rdy comecuUve-is reducible if either ABC is a K-line or ABC is anf-plane. 

^ ^ ^ Sufficient to observe that by elementary transformations the 

^ repeated factor p(A) oocnic a, 

:^o3T J>M) iac period 2, aece two factor, can be 

® tranafoimatton. wfll reduce 

ABC Moi^ to a Jorfan pentahedron ABODB, then, by the lent equation of (I), 
bL i ^uced^ ” P(-®)2>(-D), and in each case the number of factors 

3. PRonircTs of two peojeotions 

are two ea^s to be distinguished, according as AB is an e-line or a <-line. 

each e W r equations (1) show thatp(A) andp(B) commiibe, so that from 
each e-lme a snigle product p(A)p(B) of this type arises, and'this operation is of 

t The list of types of /c-line in § 1 wiU be found useful here. 



On the simple group of order 25920 333 

period 2. Thus p{A) and p{B) generate an Abelian group of order 4, direct product 
of the cyclic groups generated by p{A),p(B). And it is easy to see that p(A)p(B) is 
an axial homology having for invariant spaces the line AB and the opposite plane 
CDE of the Jordan pentahedron of which AB is an edge. Thus p{A)p{B) leaves the 
nodes A,B,C,D,E unchanged, and only these nodes. Whence it is easily seen that 
the operations of this type arising from different e-lines are distinct. And since Q 
permutes the c-lines transitively, every such operation is conjugate to p{A)p{B). 
Hence, since there are 270 e-lines, O contains 

III. A conjugate set of 270 operations of period 2, products of pairs of projections 
from nodes lying on an e-line. 

If AB is a X-line, and if 0 is the third node lying on AB, then, from (1), 

p{A)p{B)=p{B)p{C)=p(0)p(A), 
and so |j)(A)p{JS)]3 = p{A)p{B).p(B)p{C).p{G)p{A) = 1, 

so thht p{A)p{B) is of period 3. Its square (or inverse) is p{B)p{A), and there are 
just two products of pairs of projections from nodes on ABO which are distinct; the 
projections p(A),p{B),p{0) generate a group of order 6, which, as the reader will 
easily verify, is simply isomorphic with the S3nmmetric group of degree 3 (and in fj%ct 
permutes the nodes A, B, 0 according to the operations of this group). The operation 
p{A)p{B) keeps fixed the three nodes of the polar x-line of ABC, and no other node. 
Hence the operations arising from distinct x-lines are distinct.. Since O is transitive 
on the X-lines, any operation of this type is conjugate to one of the two operations 
f{A)p{B),p{B)p{A). And these are clearly conjugate, since each is transformed into 
the other by p{A),p{B) or p{G). Thus, since there are 240 x-lines, 0 contains 

IV. A conjugate set of 480 operations of period Z„each the product of projections 
from two nodes of a K-line. 


4. PRontroTS of projections from ooplanar sets of nodes 

4.1. Next consider the products of projections from sets of coplanar nodes. In 
examining such products one may, by the theorems proved at the end of §2, 
ignore products containing repeated factors or productsrinvolving projections from 
three coUinear nodes. There are four cases to consider, since there are four types of 
plane containing three or more non-collinear nodes. In the case of the /-plane or 
the d-plane the results are so easily seen that they are stated without proof, their 
verification being left to the reader. 

No new type of operation arises from an /-plane, since the product of the three 
projeetioiM from the nodes lying in it is reducible (§ 2). The group generated by the 
projections is an Abelian group of order 8, direct product of three cyclic subgrouj® 
of order 2. 



334 


J. A. Todd 


In the case of a d-plane, joining a K-line ABO to a node P on its polar ic-line, it is 
easily seen that the projections from A, B, 0, P generate a group of order 12 , the 
direct product of the group of order 6 generated by p {A),p{B),p{C) and the cyclic 
group generated by p{P), the projection p{P) commuting with every operation of 
this group. There are just two operations of a new type, namely, p{A)p{B)p{P) and 
its inverse p{B)p(A)p{P), and these operations are conjugate, being transformed 
into each other by p{A),p{B) or p{0). The operation p{A)p{B)p{P) is of period 6 , 
its sqnarebeingp(P)j?(A)andits cubep(P), so that different d-planes give different 
operations. Thus, since there are 720 d-planes, Q contains 

V. A conjugate set of 1440 operations of period 6 , products of three projections, 
whose squares are of type IV and whose oubes are of type 11. 

4-2. Next consider the operations derived from sets of nodes in a c-plane. Such a 
plane, as was seen in § 1 , contains six nodes forming the vertices of a complete 
quadrilateral, whose sides are x-lines and whose diagonals are e-lines. Denote these 
nodes by A, B, C,A', B', O' in such a way that the sides of the quadrilateral are 
A'BG,AB'C,ABG',A'B'G' and the diagonals axe AA', BB', GG'. 

It can be shown, in a variety of ways, that the projections from the six nodes in 
a c-plane generate a group of order 24, simply isomorphic with the symmetric group 
of degree 4. Thus, for mstanoe, one such plane contains the nodes ( 12 ), (13), (14), (23), 
(24), (34); and, since the effect of p{ij) on the nodes is represented by the interchange 
of i mdj in the symbols of the nodes, the operations of the group generated by the 
six projections effect the 24 permutations of 1,2,3,4 in the symbols. Alternatively, 
since the projection from any node G' on a /c-line A'B'G' is expressible in the form 
p{A')p{B')p{A'), the projections from the six nodes A,B,G,A',B',0^ can all be 
expressed in terms of p{A),p(B),p{A'y, and these projections satisfy the relations 

MA)f = [25(A)? = '[p(A')r = [p(A)p(A)? = [p{B)p{A')f ^ \!p{A)p{A')f = 1, 

which are defining relations for the symmetric group of degree 4 (Moore 1897 ; or 
Burnside 1911 , p. 464). 

Of the 24 operations of this group aU but six belong to types already considered, 
for besides identity and six projections it is seen that there are three operations of 
type III arising from the three diagonals of the quadrilateral and eight operations of 
tjjpe IV, two arising from each of the four x-lines. The remaining six operations may 
thus be identified with the six operations of period 4 in the symmetric group. Each 
of these operations can in fact be expressed, in a variety of ways, as the product 
of three projections. I shall content myself by showing, directly from the geometry 
of the figure, that they are in fact obtained by permuting A, A, O' in all possible ways 
in the product p{A)p{B)p{G). It is an easy exercise in the use of elementary 
transformations to express such a product in terms of any other set of three pro j ections 
from nodes whose joins are three of the four x-lines in the plane. Observe, first, that 
p{A)p{B)p{0) permutes the six nodes according to the permutation {AGA'O') {BB'), 



On the simple grcmp of order 25920 335 

which shows that the six operations of this type are distinct Further, by elementary 
transformations, . ^ ^ 

MA)p{B)p[0)f = p{A)p{B)p{G).p{A)p{B)p{G) 

= p{A)p{B)p{G) .p{G)p(,B')p{A') = p{A)p{B)p{B')p{A') 

= p{B)p{G')p{G')p{B')=p{B)p{B% 

an operation of type III and period 2. Thus p{A)p{B)p{G) is of period 4. And the 
transforms oip{A)p[B)p{G) by p(J.) andp(J.') are easily seen to 'bep{B)p{G)p{A) 
and p{A)p{G)p{B). From this it follows that the six operations in question are all 
conjugate in 0. Hence, as O permutes the cross-planes transitively, the operations 
of this type form a single conjugate set in G. Now the operations arising from 
different cross-planes arenecessarily distinct. For suppose, if possible, that ^4^.^... 
is a second cross-plane, the arrangement of the nodes on Ar-hnes being the same as in 
the plane AB ...G' (with the suffix 1 of course), and tThat 

. p{AT)p{B-i)p{G-f^=p{A)p{B)p{G). 

Then the squares of these operations, namely, p{By)p(B'j) and p{B)p{B'), are 
equal. Hencff"the e-lines B^B'-y and BB' are the same, and By, By coincide in some 
order with B and B'. Since 

p{Ay)p{By)p{Gy)=p{Ay)p{Gy)p{A!y)=p{Gy)p{B'y)p{A'y), 

it may be assumed, without loss of generality, that By = B, By = B'. But then, 
since 

p(Ay)p(By)p{Gy) = p(Ay)p{Ay)p(By), p{A)p{B)p(G) - p{A)p{A')p{B), 

it is seen that p{Ay)p{Ay) = p{A)p{A'). Thus, Ay, Ay coincide in some order with 
A, A' and the two planes must be identical. Hence, since each of the 540 c-planes 
gives rise to six operations of 'this kind, G contains 

VI. A conjugate, set of 3240 operations of period 4, products of projections from three 
nodes of a cross-plane, whose squares are of type III. 

4 ‘3. The situation arising in the case of a j-plane is not quite so simple. The nine 
nodes belonging to a jJ-plane lie by threes on 12 /c-lines, and the join of any two of 
these nodes passes through a third. It is famihar that the arrangement of such a 
set of nine points in triads of collinear points may be represented by the scheme 

12 3 
4 5 6 

7 8 9 

in which 1,2,.. „ 9 denote the nine points, and the triads of collinear points are given 
by the rows^ columns, diagonals and broken diagonals in the array. The nine nodes 


Vol. 189. A. 


22 



336 


J. A. Todd 


of a Jacobian plane ■will be denoted by 1,2,..,, 9 in the present section, and from the 
array just given it is seen that the 12 /c-lines are 

123, 466, 789, 147, 258, *369, 169, 267, 348, 168, 249, 357. 

It will also be convenient to denote the projection from the node i by {*}, the product 
{»} {j} t>y and so on. 

Any irreducible product of three projections from nodes in the plane has the 
form {ijk}, where i, j, Jc are three distinct nodes not lying on a /c-line. There are 
9 .8.6 = 432 such products, and it will be proved that these give rise to 18 distinct 
operations of G, each product beiug expressible in 24 equivalent forms. In fact, 
taking a t 3 rpical product {124} and using the hst of /c-lines just given, one can verify 
without difificulty that the 24 products 

{124}, {234}, {314}, {479}, .{612}, {719}, {837}, {928}, 

{149}, {248}, {347}, {498}, {623}, {796}, {875}, {986}, 

{192}, {283}, {371}, {487}, {531}, {751}, {853}, {962}, 

are all equal. Moreover, {124} and its inverse {421} keep fixed the node 6 and inter¬ 
change in pairs the nodes 1,8; 2,7; 3,9; 4,6 which lie on /c-lines through 6. Hence 
the 432 products give rise to nine pairs of inverse operations such as {124} and {421}, 
each pair keeping fixed a definite one of the nine nodes and interchanging in pairs'the 
nodes lying on /c-lines through this. It remains to prove that {124} and {421} are 
distinct. Now 

{124}3 = {124} {479} {928} = {12728} = {168} = {688} = {6}, 

so that {124}® is the projection {6}. And if {124} = {421} then {124}® = 1, so that 
{124} = {124}® = {6} and hence {12} = {64}, in contradiction with the result proved 
in § 3 that products of projections from two nodes of different /c-lines are distinct. 
Thus there are precisely 18 operations such as {ijk} which are distinct, and each of 
these is an operation of period 6. 

Next observe that in the 24 different expressions of {124} as a product of pro¬ 
jections {^yi:} the last factor {i:} takes every value except {6}. Thus the only possible 
irreducible product of four projections of the form {124Z} is {1246}. And since any 
set of five or more of the nine nodes includes at least three which are coUinear, no 
product of more than four projections fi:om the nodes in the plane can be irreducible. 
Thus there remain for consideration 18 operations {ijM}, where {ijk} is one of the 
18 operations of period 6 just considered and Z is the node left invariant by {ijk}. 
It wiU be proved that in fact aU these operations are equal either to {1246} or {4216}. 
Considerations of symmetry show that it suffices to consider a single pair of opera¬ 
tions {ijk'[},{kjil}, where Z# 6. 

Observe, first, that since {124}® = {6} and {421} = {124}-i, then 
{1246} = {124}* = {421}®, {4216} =■ {124}®, 



337 


On the simple group of order 25920 

so that {1246} and {4216} are inverse operations, each of period 3. It is thus merely 
necessary to show that the squares of another t 3 q)ical pair of inverse operations 
{ijk}, {kji} coincide with {1246} and {4216}. Taking, for instance, the operations {125}, 
{521}, which leave the node 4 unchanged, it is seen that 

{125}2 = {125125} = {125598} = {1298} = {1428} = {4728} 

= {4268} = {4216}, 

{521}2 = {125}-2 = {4216}-! = {1246}. 

There are thus just two operations of this type, and the square of any one of the' 
18 operations of period 6 coincides with one of them. Hence the projections from 
the nodes of a Jacobian plane generate a group of order 54, comprising identity, nine 
projections, . 24 operations of type IV and the 18 operations of period 6 ancLthe two 
operations of period 3 just obtained. 

It may be remarked, in passing, that the operation 8 of period 3 which is one of 
Burkhardt’s generating operations, is of the last type discussed, and several expres¬ 
sions for it as a product of four projections are given on p. 77 of Baker’s tract. 

Each of the operations {1246}, {4216} leaves invariant every node in the Jacobian 
plane. It can leave no other node unaltered. For considerations of incidence show 
that any operation of Q which keeps fixed all the nodes in a j-plane and a node P 
not lying in this plane keeps fixed each node in the Sterner prime joining the j-plane 
to P, so that it keeps fixed each c-plane in this prime and hence the remaining 
27 nodes, whose polar primes intersect the Steiner prime in these c-planes; such 
an operation of 0 must therefore be the identity. It follows that the two opera¬ 
tions of period 3 derived from each of the 40 j-planes give 80 distinct operations. And 
hence the operations of period 6 derived from different ^'-planes, whose squares 
are operations of period 3 of the type just mentioned, must also be distinct. . 

It will now be proved that the operations of each of the two t 3 q>es mentioned 
form two conjugate sets in Q. For this observe, first, that as (? is transitive on the 
j-planes every operation (of either type) is conjugate to an operation of the same 
type associated with any given j-plane. And since the group generated by the projec¬ 
tions {i} permutes the nine nodes i transitively, any operation of period 6 in the plane 
is conjugate to one or other of the operations {124}, {421} which leave the node 6 un¬ 
changed. Now, if there exists an operation ToiO such that 7'{124} T~^ = {421}, then, 
also, !r{124}2T~^ = [T'{124}T“^]® = {421}®. Thus, if {124} and {421} are conjugate 
in (7, so also are {1246} and {4216}. It will now be shown that this is not the case. 

In fact, if there exists an operation T in G such that T{1246} T-^ = {4216}, then 
the nine nodes i' = T{i) lie in a Jacobian plane, and the set of four nodes 1'2'4'6' 
is projective with 1246 and such that {1'2'4'6'} = {4216}. Since distinct Jacobian 
planes give rise to distinct operations it follows that the plane of the nodes i' coincides 
with that of the nodes i. Thus there is a projectivity in the Jacobian plane which 
keeps the set of nine nodes unaltered and transforms i into i'. Now the group of 
projectivities which leave the set of nine nodes unaltered is known to be of order 216, 
and the group of automorphisms of the set of nodes is of order 432, and consists of 


22-2 





338 


J. A. Todd 


216 coUineations and 216 anticoUineations. There are precisely 432 ordered sets of 
nodes ijh which are not coUinear, and there exists a unique automorphism of the nine 
nodes which transforms any one of these sets into any other. Bach such triad of 
nodes determines uniquely a fourth node I such that {ijM} leaves each node unaltered. 
Thus, if {ijM} and {i'j'kT} are two expressions of this type, there is a unique auto¬ 
morphism which transforms i,j,h,l respectively into i',j',k',l'. It is not difficult 
to see that any two operations of this type which can be transformed into one 
another by elementary transformations must be such that the correspondmg 
automorphism is a eoUineation. Hence, if {1246} and {4216} are conjugate, there is 
a projectivity leaving the set of nodes invariant as a whole which transforms 
l',2',4', 6' iuto 4,2,1,6 respectively. Hence, if is the projectivity induced among 
the niue nodes by T, 2^3^ is a projectivity of the set of nodes in which 1,2,4,6 corre¬ 
spond respectively to 4,2,1,6. In this projectivity the node 3 on the line 12 
must correspond to the node 9 on the line 42. Hence the pencils 6(1243) and 
6(4219) = 6(4213) are projective and consequently harmonic, whereas they are 
known to he equiunharmonic. Hence {1246} and {4216} are not conjugate in 0 
(although they are conjugate in 0*, being transformed iuto each other by an anti- 
coUineation). So, from the Jacobian planes, one obtains 

VII. operations of period 3, products of four projections, forming two conjugate 
sets of 40 operations each, 

and 

VIII. 720 operations of period 6, products of three projections, forming two conjugate 
sets of 360 operations each, whose squares are of type VII and whose auibes are of type II. 

5. Peodtjots of projections prom sets of nodes in a prime 

6-1. Now consider the products of projections from sets of nodes lying in a prime, 
and examine, in turn the four types of prime described in §1. The possibilities 
presented by an»a:-prime are soon disposed of. For an aj-prime is the join of a cross¬ 
plane to the pole of the Jordan prime which contains it. If, as in § 4-2, the six nodes 
of the cross-plane are denoted by A, J5, C, A', B', C and the pole of the Jordan prime 
containing it by 0, the projection p(0) is easily seen to commute with the projections 
from the other six nodes. Hence the group generated by the projections from the 
nodes of the prime is of order 48, and is the direct product of the symmetric group 
of degree 4 geneiyated by the projections from the nodes in the cross-plane and the 
cyclic group of order 2 generated by p{0). And the only operations of this group 
which do not clearly belong to types already obtained are the products oip{0) with 
the six operations of period 4 (type VI) belonging to the cross-plane. But, in the 
notation of § 4-2, 

p[A)p{B)p{0)p{0) = p{B)p{C')p{0)p[0), 

and this product is reducible since the plane OCC' is an/-plane. Thus no new type 
of operation arises. Incidentally, since G permutes the 640 a;-primes transitively, 
the group of order 48 just mentioned is just the subgroup of Q which keeps the 



339 


On the simple group of order 25920 

a;~prime fixed. It is easily seen that the 48 operations of this group comprise, 
besides identity, 7 operations of type II, 12 of type III, 8 of type IV, 8 of type V, 
and 12 of type VI. 

The operations derived from the nodes of an 9 ^-prime are likewise easily discussed. 
For a typical ?^-prime is that containing the ten nodes {ij) in which neither i nor j 
is 6 , and since the effect of p{ij) on the nodes is represented by the interchange of 
i and j in the symbols of these nodes it follows that the projections from the ten 
nodes {ij) generate a group of order 120 , simply isomorphic with the symmetric 
group of degree 5. And, since there are 216 w-primes permuted transitively by the 
operations of (?, this group is simply the subgroup of G which leaves the y 2 .-prime 
invariant. It is easy to see that the only operations of this subgroup which are not 
of a type already considered are the 24 operations of period 5 which permute the 
symbols 1 , 2 ,3,4,5 in some cyclic order. And since any ijyclic permutation {ijklm) 
is equivalent to the product of four transpositions—^for example, to the product 
(ml) {mh) (mj) (mi )—^it will be seen that each of these operations is expressible as the 
product of four projections, the 'pvodnot p{ml)p{mk)p{mj)p{mi) being a typical one. 
These 24 operations are conjugate in the symmetric group of degree 5, and hence 
are conjugate in 0, which contains this symmetric group as a subgroup. Hence, 
since G permutes the 216 ^-primes transitively, all the operations of this type form 
a complete set of conjugates in G, Each of these operations permutes the ten nodes 
of the ?^-prime from which it arises in two cycles of 5, the five nodes in any cycle not 
all lying in a plane. But it is easily seen, from the incidence relations of § 1 , that any 
two 7 i-primes have at least one of the nodes in common. Whence it follows that no 
two operations arising from different n-primes can be identical. Thus G contains 

IX. A conjugate set o/ 5184 operations of period 5, euch the product of four pro¬ 
jections. 

The 120 operations which keep the. %-prime fixed comprise identity, ten operations 
of type II, 15 of type III, 20 of type IV, 20 of type V, 30 of type VI, and 24 
of type IX. 

5-2. Consider next a Jordan prime il, whose pole is denoted by 0. Then II 
contains 12 nodes forming the vertices of three desmic tetrahedra, any one of which, 
together with 0, forms a Jordan pentahedron. These tetrahedra will be denoted by 
(T^ (i = 1 , 2 , 3 ) and the vertices of cr^ by Q, Di, and symbols such as P^, Q ^,... 

will be used to refer to unspecified vertices of cr^. The line joining any vertex of 0*1 
to any vortex of 0*2 contains a vertex of 0 ^ 3 , and the notation may be chosen in such 
a way that the vertex of 0*3 is determined by the scheme 



A2 


C, 


A 

A3 

B, 

C'a 

^a 

Si 

-B3 

A3 


^3 

Cl 


A 

A3 

Bz 

A 

A 


B, 

A3 



340 


J. A- Todd 


Any product of projections from a set of non-coplanar nodes involes four factors 
at least, and so, if the nodes lie in i7, must contain at least two projections from* 
nodes of the same tetrahedron. If it contains three such projections it is reducible. 
Thus any irreducible product of projections from non-coplanar nodes in U can be 
reduced by elementary transformations to the form p{Pi)p{Qi) where 2^ is a 
product of two or more projections from nodes of the tetrahedra cTp <Tj^ distinct from 
cr^. If T contains as factors two projections p{Pj),p{Sjc) from nodes of different 
tetrahedra, then since the /c-line BjSj^ contains a node belongiifg to cr^ one can 
transform p{Pi)p{Qi) T into a form involving three projections from nodes of cr^, 
and this product is accordingly reducible. It follows at once that every product of 
five or more projections from nodes of JJ is reducible, and that any irreducible 
product of four projections must be expressible in the form 

p{Pi)p(Qi)pi^j)piSj), 

where PiQi,Rj8j are non-intersecting edges of distinct tetradedra cr^,cr^. Since the 
edges of the tetrahedra are e-lines the projections from P^, (or Rp Sj) commute. 
Thus there are essentially 18-8 = 144 products of this type to consider. It will be 
shown that these determine 12 distinct operations. 

Consider, for instance, the typical product p(A-i)p{Bj)p{A2)p(02)- Then, by 
elementary transformation, 

p{Ai)p{B^)p{Ai)p{Oz) = p{Az)p(As)p{B^)p{Cz), 
and the latter product can'be transformed, in two ways, to give 

PiAi)p{ 02 )p(C^)p{Dj) and p(As)p{Bs)p{Bz)p( 02 ). 

By means of transformations of this type it is seen that in any product 

P(Pi)p{Qi)p{Pj)p{Sj) 

of the type considered the two tetrahedra cr^, cr^ to which P^ Qp R^ 8j belong may be 
assigned in advance. Moreover, 

P(Ai)piB^)p(Az)p{C2) = p{Aj)p{Bj)p{d)p{0)p(A2)p{Gi) 

= p{Oi)p{Dx)p{B^)piD,), 

since 0 forms a Jordan pentahedron with each of the tetrahedra cr^, so that the same 
operation arises if each of the edges P^Q^R^S^ is replaced by the opposite edge of 
the tetrahedron to which it belongs. There are thus at most 12 distinct operations of 
this type, aU expressible in the form p(^i)p(Ji)p(jB 2 )i’('® 2 )> "where is a vertex of 
(Ti distinct from A^ and is of the four edges of cr^ which do not meet A^P^- 

These operations are easily seen to be distinct, fox p{Py)p{R^)p{8z) is an operation 
of type VI, associated with the cross-plane P^R^S^ (which does not contain A-^), 
interchanging the two nodes of this cross-plane which belong to Cy, and two distinct 



On the simple group of order 25920 341 

(inverse) operations of this kind arise from each of the six cross-planes in IJ which 
do not contain A^. Further, since 

{piA^)pi£,)p{A,)p{0,)Y = p{A^)p{Bf)piA^)p{0,).p{A^)p{0^)p{C^)p{Dy} ^ 

• = PiA)Pi£i)p{Oi)p{Dj) = p{0), 

each of these operations is of period 4, and operations arising from different Jordan 
primes have different squares and so are distinct. By using identities such as 

p{A^) .p(Aj)p{Bj)p{A^)p{C^) .p(A^) = p{Aj)p{Bj)p{Bi)p{G^), 

PiSi).p{Aj)p{Bj)p{A2)piO^).p{Bj) = p{Aj)p{I}y)p{A^)p{Gi), 

it is easy to see that the 12 operations derived from U are all conjugate. Hence, as 
0 is transitive on the Jordan primes, the operations of this type in Q form a single 
conjugate set, and hence 0 contains 

X. A conjugate set of 540 operations of period 4, products of four projections, whose 
squares are of type IL 

The group generated by the projections from the nodes in U may now be deter¬ 
mined. Since p{0) belongs to the group it is necessary to consider not merely the 
operations of various types derived from nodes in il but also the products of these 
with^(O), when these products are irreducible. It is not difficult to verify that this 
group is of order 192, and that its operations consist of (i) identity, (ii) the projection 
p{0), (hi) the 12 projections from nodes in H, (iv) the 12 products of these with 
p(0), which are of type III, (v) the 18 products of projections from two nodes on an 
edge of one of the tetrahedra in II (type III), the product of such an operation by 
p{0) being the corresponding operation derived from the opposite edge of the 
tetrahedron, (vi) 32 operations of type IV, two arising from each of the 16 /c-lines in 
n, (vh) the 32 products of these with p(0), which are of type V, (viu) 72 operations 
of type VI, six arising from each of the 12 c-planes in II, the product of any one of 
which with ^9 (0) is another operation of the same set, (ix) the 12 operations of type X 
arising from 77, the product of any oiie of which with p{0) is the inverse operation. 
This group has the cychc group generated by p{0) as a self-conjugate subgroup, 
the two operations of which leave fixed each node in U, and the nodes of II are 
permuted by the operations of the group of order 192 according to the 96 operations 
of the factor group of this cyclic group. The group of order 192 is, as will be seen 
presently, a subgroup of the group of order 676 consisting of all the operations of G 
which leave 77 and its pole 0 invariant. 

5'3. Next consider the products of projections from nodes lying in a Steiner 
prime X. Such a prime contains 18 nodes lying by threes on the edges of a tetrahedron 
whose faces are Jacobian planes and whose edges form three pairs of polar /c-lines. 
Baker has given, on p. 79 of his tracts explicit equations for the projections from the 
18 nodes of one typical Steiner prime. It wall be found convenient to denote the nodes 
of 27 by the symbols used by Baker to denote these projections. Accordingly, the 
nodes wiU be denoted by X, 7, Z, P, Q, Il,U,V, W,X\ T', Z\ P', Q\ It\ U', V\ W\ 



342 


J. A. Todd 


the notation being such that XYZ,X'TZ'; PQB, P'Q'R'; UVW, U'V'W are the 
three pairs x,x';p,f)';u,u' of opposite edges of the tetrahedron. Reference to Baker’s 
tract shows that the arrangements of the nodes in the four Jacobian planes are 
given by the scheme 

XYZ XYZ X'Y'Z' X'Y'Z' 

PBQ P'B'Q' PQB P'Q'B' 

wvt w'v'v ru'w vuw 

where, in each set of nine nodes, the arrangement on /c-lines is determined, as in 
14-3, by the rule that three nodes on a /c-line occur either aU in the same row or 
column or else each in a different row and column. 

The examination of sets of nodes of X which give rise to irreducible products of 
projections may be carried out by methods similar to these used in the discussion of 
the Jordan prime, but this procedure is somewhat laborious and not particularly 
interesting, although it was the method by which the types of operation obtained 
in this section were originally derived. I shall proceed here in a different manner, 
first examining certain products which prove to yield new types of operation, and 
then showing that aU irreducible types have been discovered by proving that all 
the operations of the group generated by the projections from nodes of X (which is 
necessarily a subgroup of the group of operations of G which keep X fixed, and 
proves in fact to coincide with this group) are accounted for by operations of types 
already determined. 

The first type of product considered is a product of four projections of the form 
T^T^, where is the product of projections from two nodes on an edge of the tetra¬ 
hedron and Tj is the product of projections from two nodes on the opposite edge. 
Since opposite edges of the tetrahedron are polar /c-lines and commute. Hence, 
as each /e-hue gives rise to two distinct products of two projections, there are 12 
products of this type to consider (not regardmg and as distinct). As a 
matter of fact these 12 products give rise to six distinct operations, each being 
expressible in two essentially different ways. In fact, as is stated by Baker, there are 
identities of the form 

P( Y)p{Z)p{ Y')p{Z') = p(V) p( W) p{V') p{W'), 
which may be proved by noticing that 

p{Y)p{Z)p(Y')piZ') = p{ Y)p(Z)p(P) .p{P) p{ T) p[Z') 

= p{P)pmp{V) .p{W')pmp{P) 

= pijj)p{y)p{R)-p{R)p{W')p{U') 

= p{'0)p[y)p{W')p{U') = p{y)p{W)p{r)p{W'). 

The typical operation p{Y)p(Z)p{Y')p{Z') is easily seen to keep fixed each node 
on pj p’ and to permute cychcaUy the nodes on each of the other edges of the tetra.- 
hedron. Whence it is easily seen that there are six distinct operations of this type 
associated with X, and that these can be transformed into one another by means of 



343 


On the simple group of order 25920 

projections from nodes lying in -T, so that they are all conjugate. Moreover, 
/c-line of Z other than the edges of the tetrahedron, and no c-plane lying in Z (which 
contains one node lying on each edge of the tetrahedron) is left unaltered by this 
operation. Whence different Steiner primes must give rise to distinct operations. 
Since 0 permutes the Steiner primes transitively, all these operations are conjugate 
in 0. Since T-^ and are commuting operations of period 3, has period 3. 
Hence 0 contains 

XI. A conjugate set of 240 operations of period 3, products of four projections, each 
expressible in two ways as a product of two operations of type III arising from a pair 
of polar K-lines. 

The next type of product to be considered is pf the form Tj^.p{A), where is an 
operation of type VIII, product of three projections from nodes in a Jacobian plane 
in Z leaving a node B of this plane fixed, and ^ is a node lying on the edge of the 
tetrahedron opposite to that which contains B. There are 18 operations such as T-^ 
associated with any one of the Jacobian planes, and three positions possible for A 
corresponding to any operation T. Thus each Jacobian plane gives rise to 54 products 
of this type. It wiU be proved that the same set of operations arises from each of the 
four Jacobian planes in Z, and that the 54 products which arise are all distinct. 

Consider the typical operation T p{X)p{T)p{TJ)p{U^), where p(Z)p(r)j9(17) 
is an operation of type VIII associated with the Jacobian plane xpu leaving the 
node V fixed, and XJ' lies on the polar /c-line of the edge u of the tetrahedron which 
contains F. Then 

p{X)p{Y)p{U)^{U') = p{X)p{Y)p{TJ')p{U), 

p{X)piY)p{U)p{U')^p{U')p{R')p{P')p{V)=^p{B')p{P’)p{W')p{U) 

= p{B')p{P')p{JJ)p{W'), ' 

p{X)p{Y)p{U)p{U')=p{U)p{,B)p{P)p{U')=p(,B)p{P)p{W)p{U') 

= p{B)p{P)p{JJ')p{W), 

and p{X)p{Y)pm, p{B')p{P’)pm, p{B)p{P)p{JJ') 

are operations of type VIII associated respectively with the Jacobian planes 
xp'u\x'p'u,x'pu\ leaving unaltered, respectively, the nodes F^F, F'. Thus the 
operation T arises equally from any one of the Jacobian planes. The operation T 
is easily seen to permute the nodes in Z according to the permutation 

{XX'YZ'ZY') [PB'QrBQ'){UW) (j7'F'), 

so that it leaves F and V' unaltered, interchanges the remaining nodes on the edges 
u, u\ and interchanges the other two pairs of opposite edges of the tetrahedron, 
permuting the six nodes on the edges of each pair in a cycle of 6. There are two 
distinct operations of this type (inverse to each other) keeping fixed any one of the 
27 pairs of nodes such as F, F' lying on opposite edges of the tetrahedron. Thus the 
54 operations so arising are distinct. Further, T interchanges the Jacobian planes 



344 


J. A. Todd 


{mpu,x'f'u) and {xp'u',x'pu') and permutes the three cross-planes of U which 
contain V and V', namely, the planes containing 

VV'XZ'QQ', VV'ZX'PP', VV'YY'PP', 

in cyclic order. Whence it is easily seen that two distinct Steiner primes cannot 
give rise to the same operation. 

By elementary transformations it is foimd that 

T^=p{X)p{Y)p{U)p(U').p{X)piY)p{U)p{U') 

^p{X)p{Y)p{U)p{U').p{U)p{U’)p{X')p{Z')=p{X)p(Y)p{X')p{Z’), 

an operation of period 3 and type XI, and that. 

T^ = piX)p{Y)p{X')p{Z').p{X)p{Y)p{U)p{U') 

= {^{X)p{Y)fp{X')p{Z')p(U)piU') 

= p{Y)p(X)p{X')p{Z’)p{U)piU’) = p{Y)piX)p(U)p{U')p{X)p{Y) 

= piV)p{V'), 

an operation of period 2 and type III. Thus T is of period 6. 

It is easy to see that the subgroup of 0 which keeps X fixed contains operations 
which transform any pair of opposite edges of the tetrahedron into any other pair, 
and operations (products of projections from suitably chosen nodes on u and u') 
which keep u and u' unchanged and transform F, V' into any other assigned pair of 
nodes on these edges. Thus every one of the 64 operations T derived from X is 
conjugate to one of the two operations T, T~^ which keep V and V' fixed. And T 
and T~^ are conjugate, for 

T^p{X)p{Y)p{U)p(U'), T-^=p{U')p{U)p(Y)p{X)=p(Y)p(X)piW')p(W), 
and 

. MY)p{Y')] T[p{Y)p{Y')r^ = p{Y)p{Y')p{X)p(Y)p{U)p{U')p(Y')p{Y) 

^p(Y)p{X)p{Y)p(Q')p{Q)p{Y) 

= p{Y)p{X)p(W')p(W) = 

Thus all the 64 operations derived from Z are conjugate. And since 0 is transitive 
on the Steiner primes the operations of this type corresponding to different primes 
are all conjugate in G, Thus G contains 

XII. A conjugate set of 2160 operations of period 6, products of four projections^ 
whose squares are of type XI and whose cubes are of type III 

The next type of product that arises can be put in the form T^-piA) where, again, 
is an operation of type VIII associated with one of the Jacobian planes in Z 
leaving a node B in this plane invariant, and A is a node in Z, not lying in the 
Jacobian plane, such that A and B lie on intersecting edges of the tetrahedron. For 
each operation there are thus six possible positions for A. Hence there are, in all. 



On the simple group of order 25920 345 

4.18.6 = 432 products of this nature to be considered. I shall prove that these lead 
to 144 distinct operations. 

A typical operation of this type is T = f{X)p{Y)p{U)p{X'). By elementary 
transformations it is seen that 

p{X)p[Y)p{U)p{X’)^p{X)p{Y)p{It')p(U)=p{R')p{U')p{r)f(U) 

=^p{U')p{V')p{Q')p{U) 

= Pmp{V')p{Y')p(Q')=^p(Y')p{Q)p{R)pm 

=p{Q)p(R}p(X')p{Q'). 

Thus T is expressible in the form 2^ .p{A) in three ways, in which the operation 
belongs to the respective Jacobian planes xpu, xp'u', x'pu'. Hence the 432 products 
give rise to at most 144 distinct operations. The operation T permutes the 18 nodes 
lying in i7 in the cycles 

{XQV'ZPU' YRW) {X'Q'U) ( Y'R'W) (Z'P'V). 

From this it is easily seen (i) that no cross-plane lying in 2" is unaltered by T, (ii) that 
the three Jacobian planes in X other than x'p'u are permuted cyclically, (iii) that the 
Jacobian plane x'p'u is left unchanged, but that the three triads of /c-lines 

(X'Q'W, Y'R'V, Z'P'U), (Q'UY', R'WZ', P'VX'), {UX'R', WY'P', VZ'Q'), 

which are the edges, lying in this plane, of the tetrahedra of Jacobian planes in the 
three Steiner primes other than X which contain the plane x'p'u, are permuted 
cycUcaJly. From these properties it is easy to see that distinct Steiner primes give 
distinct operations. 

It will now be proved that there are just 144 distinct operations of this type 
derived from X. In the first place it is to be observed that 

= p(X)p{Y)p{U)p(X').p{X)p(Y)p{U)p(X').p{X)p(Y)p(U)piX') 

= [piX)p{Y)]Kp{V)p{X')p(U)p{X').p{X)p(Y)p{V)p{X') 

= p(Y)p{X).p(X)p{Y)p{W)p{P')p{U)p{X') 

= p{W)p(P')p{U)p{X'), 

which is easily recognized as an operation of type VII and period 3 associated with 
the Jacobian plane x'p'u. Thus, since T contains a cycle of period 9, T is an operation 
of period 9. The three operation! T, T*, IP effect the same permutation of the nine 
nodes in the plane x'p'u, this permutation consisting of three cycles each of three 
non-collinear nodes. It is easy to see that the three operations p{X)p( Y)p{U)p{A), 
where A = X', Y' or Z', the transforms of these operations hjp{X'), and the squares 
of these six operations (which are all of the same typef as T), effect 12 different 
permutations of the nine nodes in x'p'u. Thus these 12 operations, together with 
their fourth and seventh powers (which may be verified to be of the same typej) 

t In fact, T<^ = F) p(X) p{V) p(R'). 

t In fact, T*=p[X)p{Y)p{V)p{Z'), T’= p(X) p(7) p{W) p(r'). 



346 


J. A. Todd 


give 36 operations of this type, all distinct, which, leave the plane x'p'u unchanged. 
Thus, from the four planes xpu, xp'u', x'pu', x'p'u we get 144 distinct operations. 

These operations cannot all be conjugate in G, for their cubes, which are of type 
VII, belong to two distinct conjugate sets. It will be proved, in fact, that they form 
two conjugate sets in G. In the first place, since 

p(X)p{Y)piU)=p{U)p(B)p{F) =p(B)p{P)p(W), 

the operations p{X)p{7)p{U)p{X') and p{B)p{P)p(W)p{X') are identical. It 
follows that every operation T-^.p{A) of the type considered can be expressed as a 
product Y-i)p{U-^p{X'j) of projections from four nodes Xi, Tj, U^, Xi lying 

in a Steiner prime Xi with the property that X^Ti is an edge of the tetrahedron 
formed by the Jacobian planes in A, ZJi lies in one of these planes through XjTi, 
and X[ lies on the polar x-line of X^l^. Now there is a unique automorphism of the 
set of nodes in which X, Y, U, X', correspond respectively to Xi,Ii, ^ov 

incidence relations determme uniquely (i) the node corresponding to the third node 
Z on the Ac-Une XY, (ii) the nodes corresponding to the remaining nodes in the 
Jacobian plane XYU, (iii) the nodes corresponding to the remaining nodes in X, 
(iv) the nodes corresponding to the remaining 27 nodes, which are the poles of the 
Jordan primes containing the various cross-planes in X. Hence any two products 
of the type .p(A) are conjugate in the group G* of automorphisms of the set of 
nodes. Since 0 is a subgroup of index 2 in G** these operations form at most two 
conjugate sets in G. But it has been seen that they cannot aU belong to the same 
conjugate set. Thus they form exactly two conjugate sets, and hence G contains 

Xin. 6760 operations’of period 9, prodmts of four projections, forming tivo conjugate 
sets of 2880 operations each, whose cubes me of type VU. 

It is not difilcult to prove that any irreducible product of four projections from 
nodes of X which do not lie in a plane can be reduced by elementary transformations 
to one of the three types XI, XII, XIII. Any other irreducible product thus contains 
at least five factors. There is, in fact, just one further type of product which is 
irreducible, namely a product of five projections expressible in the form T-i_.p{A), 
where is an operation of t 3 q)e VII, the product of projections from four nodes in 
a Jacobian plane lying in X, which leaves every node in this plane invariant, and A 
is a node in X which does not he in this plane. Since each Jacobian plane determines 
two such operations and there are nine nodes of X not lying in any given plane, 
there are 72 products of this type to be considered. In fact these give rise to 36 
distinct operations. 

Consider the typical product T = p{X)p{ Y)p{P)p{Q)p{X'). It is easily seen that 
this operation leaves invariant the four nodes X, Y, Z, Z', interchanges X' and Y', 
and permutes the remaining nodes in X in two cycles of six, namely, 

{PW'QXJ'BV'){,P'VB'UQ'W). 

Thus T leaves invariant the two Jacobian planes x'pu', x'p'u which pass through x' 
and interchai^es the two Jacobian planes xpu, xp'u’ which pass through the fine x 



347 


On the simple group of order 25920 

oontaming three invariant nodes. Considerations of a nature similar to those 
previously employed show that distinct Steiner primes must give rise to distinct 
operations. 

Since T = p{X)p{J)p{P)p{Q)p{X') = p{X)p{Y)p{X')p{V')p{W') 

= f{X)p{Y)!p{r)p{W')p{T), 

andp(X)p( Y)p{V')p{W') is an operation of tj^^e VII associated with the Jacobian 
plane xp'u', each operation of this type is expressible in at least two ways in the 
foirm .p(.i4). On the other hand, the two operations T and T~} (which are distinct 
since they eiEfect distinct permutations of the twelve nodes on the edges p,p',u,u' 
of the tetrahedron) keep fixed X, Y, Z, Z', that is, they keep fixed Z' and each node 
of the polar /c-line of the edge of the tetrahedron on which Z' lies, and by symmetry 
a similar pair of operations is associated with each of the 18 nodes in E. There are 
thus exactly 36 distinct operations of this type derived from each Steiner prime. 

Since p{X) p(Y) p{P) p{Q) = p{It)p{P)p{X)p(Y), 

it follows that 

= p{B)p(P)p{X)p{Y)p{X’).p(X)p{Y)p{P)p{Q)p{X') 

= p{R)p{P) lp{:^)p{YW-p{^')piP)piQ)pi^') 

= p(lt)p{P)p(Y)piX)p{V')p(W') 

= p{B)p{P)p{Y)p{Q')p{P')p{X) 

= p{B)p{P)p{Q')p{P')p{X)p{X) = p{B)p{P)pmp{P% 

an operation of type XI and period 3, and 

= p{X)p{Y)p{P)p{Q)piX').p{B)p{P)p{Q')p(P') 

' ' =p{X)piY)p(P)p(Q)piB)p{P).p(Z')p{Q')p(P') 

= p{X)p{Y)p{Q)piP).p{V)p{U)p{Z') 

= p{X)p{Q)p{W)p{P)p{W)p(V)p(Z') 

= p(X}p{Q)p{X)p(V)p{Z') 

= p{Z'), 

so that T is of period 6 (and incidentally does not belong to a type aheady considered). 

It has been seen that T can be expressed in the form T^.p{A), where 7^ is an 
operation of tj^e VII in just two ways, the two operations being ^ 

p{X)p(Y)p{P)p{Q) and p{X)p{Y)p{r)p{W'), 

and the corresponding projections p(.4) being jp(X') andp(F')- Since 

p(Z').p{.X').p(Z')=p{Y') 

and 

p{Z').p{X)p{Y)p{P)p{,Q) .p{Z') = p{X)p{Y)p{U')p{7') = p{X)p{Y)p{V')p{W% 



348 


J. A. Todd 


it is seen that the two operations'2\ which give rise to T, and the two corresponding 
projections 'p{A), are transformed into each other by the same operation of Q. 
Hence if T^.piA') is any other operation of this type which is conjugate to T, then 
T'^ must be conjugate to T-y. Since the operations of type VII form two conjugate 
sets in Q there must therefore be at least two conjugate sets of operations of the type 
under consideration. But, by reasoning similar to that used in discussing the 
operations of type XIII, it is easy to see that any two operations of this type are 
conjugate in (?*, the group of automorphisms of the set of nodes. Hence the number 
of conjugate sefts is seen to be exactly two, and hence Q contains 

XTV. 1440 operations of period 6, products offive projections, forming two conjugate 
sets of 720 operations each, whose squares are of type VII and whose cubes are pro¬ 
jections. 

Every product of projections from a set of nodes lying in E belongs to a type 
already considered. Eor the projection from any node in S leaves E unchanged, so 
that the group generated by the projections from the 18 nodes in E either coincides 
with, or is a subgroup of, the group of order 648 consisting of all the operations of 
0 leaving E invariant. In fact, it coincides with this group of order 648, and its 
operations consist of (i) identity, (ii) 18 projections from nodes in E, (iii) 27 operations 
of type III arising from e-lines in E, (iv) 84 operations of type IV arising from AC-lines, 
(v) 36 operations of type V arising from d-planes, (vi) 162 operations of type VI 
arising from c-planes, (vii) 8 operations of type VII and 72 of type VIII arising from 
y-planes, and (viii) 6 operations of type XI, 64 of type XII, 144 of type XIII, and 
36 of type XIV; making 648 operations in aU. Thus every product of projections 
from a set of nodes in E has been accounted for. 

6. The.bbmaiiokg operations oe G, and the subgroup 

LEAVING A NODE INVARIANT 

Now return to the discussion of the subgroup of G, which will be denoted by G^ (in 
accordance with the notation of §7 below), leaving invariant a node 0 and the 
Jordan prime il which is the polar of 0. This group is of order 676. It was proved, 
in § 6-2, that the projections from the nodes in II generated a subgroup of G^, of 
order 192, and the operations of this subgroup were enumerated in nine sets (i) to (ix). 
The other operations of known types which belong to G^ are easily seen to be; (x) 16 
operations of type VII, two associated with each Jacobian plane through 0; (xi) 16 
operations of type VIH, two associated with each Jacobian plane through 0, which 
are the products of p(0) with the operations of the previous set; (xii) 32 operations 
of type XI; (xiii) 96 operations of type XII; (xiv) 128 operations of type XIV, 
falling into two sets, of 32 and 96 operations respectively, which are the products of 
p{0) with the operations of the two preceding sets. In each of the last three oases, 
the number of operations in the set is ftm/46, where n is the total number of operations 
of the type considered and m is the number of nodes left unchanged by an operation 



349 


On the simple group of order 25920 

of this type. The division of the 128 operations of the set (xiv) into sets of 32 and 
96 arises from the fact that of the four nodes left invariant by an operation of type 
XIV a particular set of three lies on a /c-line. 

These fourteen sets account for 480 operations of 0^. It 1^11 be proved that the 
remaining 96 operations belong to a new type, and that this type, together with those 
obtained previously, exhausts the operations of Q. 

Since p{0) leaves each node of U invariant (and is the only operation of 0 other 
than identity with this property) the operations of permute the 12 nodes of U 
according to a group of order 288. The 480 operations of G^ listed in the sets (i) to (xiv) 
account for 240 of these permutations, and it will be convenient to list ty 3 >ical 
permutations of each type that arises. The notation of § 6-2 is used for the nodes in 
n, and the types of permutation are referred to as (a), (b ),..., (i); with an indication, 
for each type of permutation, of those of the sets (i), (ii),..., of operations of G^ 
from which the permutation arises. The details of the calculation (which are 
elementary) are left to the reader. The 240 permutations, of the nodes in JJ are • 

(а) the identical permutation [(i), (ii)]; 

( б ) 12 permutations such as which keep fixed all 

the vertices of one tetrahedron and interchange in pairs the vertices of the other 
tetrahedra collinear with one vertex of the former (in this case the vertex .d^) 
[(iii),(iv)]; ^ ' 

(c) 9 permutations such as {A^B^) (CaDj) (.dj JSg) (CgDj) which leave invariant the 
vertices of one tetrahedron and interchange the vertices of a pair of opposite edges 
of each of the other two, the two edges of one of these tetrahedra meeting the two 
edges of the other [(v)]; 

(d) 32 permutations such as {A^A^A^iB-^B^B^ {0x0^0^ permutmg 

the tetrahedra cyclically, one cycle of three nodes (here A^A^Af) lying pn a 
/f-line and the others in the three cross-planes passing through the /c-line [(vi), 
(vii)]; 

(e) 36 permutations such as {A^B^^B^A^ (da^Bj) (Ca^z) 'which inter¬ 

change in pairs the vertices on a pair of opposite edges of one of the tetrahedra and 
permute cyclically the other nodes in J7 which he in two cross-planes, one through 
each of the edges, in such a way that AxB^,B^Bx ... are /c-lines [(viii)]; 

(/) 6 permutations such as {AxCx){BxDx){A^^{C^D^{A^D^{B^G^ which 
interchange the nodes on three mutually skew pairs of opposite edges of the 
tetrahedra [(ix)]; 

(p) 16 permutations such as {AxB^B^{BxD^Gs){GxGzA^{DxAzD^ permuting 
the tetrahedra cychcally, the four cycles of three nodes belonging to four mutually 
skew /c-lines, whose polars he in a Jacobian plane through 0 [(x), (xi)]; 

Qi) 32 permutations such as {BxBxGx) {A^G^D^ (djCsDj) keeping fixed the nodes 
on a /c-hne (here AxB^B^) and permuting cychcaUy the three other vertices of each 
tetrahedron [(xh), (xiv)]; 

(i) 96 permutations such as (djjBj) {BxD^DxA^GxG^ (J. 3 D 3 C 3 ) which keep fixed 
a vertex (here jBj) of one tetrahedron, permute cychcaUy the other vertices of this 



350 


J. A. Todd 


tetrahedron, injierohange two vertices of the other tetrahedra lying on a /r-line 
through the fixed vertex, and pemaute the other nodes in a cycle of six in such a way 
that consecutive edges of the hexagon pass through 

t/j, Ag, Oj, A3, -1^3 

where the cyclic order is the same as that in which A3,2)3, C3 are permuted [(xiii), 
(xiv)]. 

It will be proved that the remaining operations of consist of: 

(xv) 96 operations of the form T^T^, where is a product of projections 
from three non-ooUinear nodes in a Jacobian plane through 0 which leaves 0 
invariant, and is a product of projections from two vertices of one of the three 
tetrahedra is 27, the product of any such operation with jp(0) being of the same 
type. 

Ihorther, it will he shown that the permutations effected by these operations on 
the nodes in 27 consist of: 

{j) 48 permutations of the form permuting 

the vertices of the three tetrahedra in two cycles of six, the square of the permutation 
being a permutation of the set (g). 

In the first place, if is the product of two projections from nodes on an edge of 
one of the tetrahedra, then T^fiO) is the product of projections from the two nodes 
of the opposite edge. Thus if lila is an operation of the type considered, so &lso is 
TiT^piO). Moreover, the permutations effected by and on the nodes in 27 
have the forms (g) and (c) respectively, and the product of the typical permutations 
given for thes.e forms is the permutation given for the form (j). An examination 
of the nature of the permutations (g) and (c) readily shows that the two permutations 
chosen are typical of any such pair. Moreover, it is easily seen tliat precisely 48 
permutations of type {j) exist, three having any given permutation of the form (p) 
as square; and this indicates the existence of 96 operations of the set (xv), two 
giving rise to each of the 48 permutations of the nodes in 27, these being of the forms 

Further study of the operations of this type, which now follows, makes it clear 
that the operations are completely symmetrical with respect to the three tetra¬ 
hedra, and that each product can be expressed in three equivalent forms, in 
which Tg is associated with each of the three tetrahedra in turn. A direct verification 
of this, by means of elementary transformations, is left to the reader. The relation 
of these operations to the whole set of nodes will now be considered in detail. 
Take, as a typical operation of this t 3 p)e, the product 

L = p(14)p(12,66,34)p(16,36,24) .p(16) p(16,46,23), 

where the first three factors form an operation of type VIII keeping the node (45) 
and a Jacobian plane through it fixed, and the last two factors form an operation 
Tj, the product of projections from two nodes in the polar prime of (45). By means of 



On the simple group of order 26920 351 

elementary transformations the product can be transformed into a rather more 
convenient form. Thus 

p{U)p{\2, 56,34)^)(16,35,24)p(16) p(16,45,23) . 

= ;p(46)5)(41)i)(15,34,26)p(16,35,24)p(16, 45,23) 

= 23(46)p(41)5)(15, 34,26)55(34)^(16,35,24) 

= ^>(46) 53(4i) 55(43) 55 ( 15 ,34,26) 55 ( 16 ,35,24), 

and so L = 55 ( 46 ) 55 ( 41 ) 55 ( 43 ) 55 ( 15 ,34,26) 55 ( 16 ,35,24), 

which will be regarded henceforth as the standard form. Now 

= 55(46)55(41) 55 ( 43 ) 55 ( 15 ,34,26) 55 ( 16 ,35,24) 

X 55(46) 55(41) 55 ( 43 ) 55 ( 15 ,34, 26 ) 55 ( 16 ,35,24) 

= [2J(46) 55(41) 55 ( 43 )P . 55 ( 12 ,35,46) 55 ( 13 ,45,26) 55 ( 15 ,34, 26 ) 55 ( 16 ,35,24) 

= 55(16) 55(36) . 55 ( 41 ) 55 ( 43 ) . 55 ( 12 ,35,46) 55 ( 16 ,35,24) p{U, 56,23)55(14,36,25) 

= 55(36) 55(31) 55(31) 55 ( 41 ) . 55 ( 26 ) 55 ( 12 ,35, 46 ) 55 ( 14 , 66 , 23 ) 55 ( 14 ,36,25) 

= 55(36) 55(41) 55(26) 55 ( 14 ,66,23) 55 ( 12 ,35,46) 55 ( 14 ,36,25) 

= 55(26) . 55 ( 23 ) 55(41)55(14,66,23) . 55 ( 12 ,36,46) 55 ( 14 ,36,26) 

= 55(26) 55(56) . 55 ( 14 ,23,56) 55 ( 12 ,35,46) 55 ( 14 ,36,26)t 
= 55(25) 55(26) 55 ( 14 ,23,66) 55(14,36,25) 55 ( 12 ,36,46) 

= 55(26) 55 ( 14 ,23,66) 55 ( 12 ,36,46), 

which is easily recognized as an operation of type VIII leaving (45) invariant, and 

= 55(46) 55(41) 55 ( 43 ) 55 ( 16 ,34,26) 55 ( 16 ,36,24) . 55 ( 26 ) 55 ( 14 ,23, 66 ) 55 ( 12 ,36,46) 

= 55(46) 55(41) 55 ( 43 ) 55 ( 25 ) . 55 ( 12 ,34,' 66 ) 55 ( 16 ,23,46) 55(14,23,56) . 55 ( 12 ,36,46) 

= 55(16) 55(36) 55(26) 55 ( 46 ) 55 ( 16 ,23,46) 55 ( 14 ,23, 66 ) 55 ( 26 ) 55 ( 12 -, 35,46) 

= 55 ( 16 ) 55(36) 55(26) 55(16,23,46) 55 ( 26 ) 55 ( 12 ,36,46) 

= 55(16) 55(36) 55(26) 55 ( 26 ) 55 ( 12 ,36,45) 55 ( 12 ,36,46) 

= 55(16) 55(36) 55(26) 55 ( 26 ) 55 ( 66 ) 55 ( 12 ,36,46) 

= 55(16)55(36)55(26)55(12,36,46), 

which is an operation of type X whose square is the projection 55(46). Thus L is an 
operation of period 12, and i® = 55(45). 

The operation L keeps the node (46) fixed; it permutes the nodes in the polar prime 
of (46) in two cycles of six, and the remaining 32 nodes in two cycles of four and two 
cycles of twelve, the actual cycles being 

[(12), (16), (13,26,46), (36), (16,45,23), (13)], 

[(23), (26), (12,45,36), (16,23,46), (13,45,26), (12,36, 45)], 

[(24), (12,34,66), (26), (12,36,46)], [(14,23,56), (15,34,26), (15,23,46), (14,36,26)], 

[(14), (16,26,34), (13,66,24), (12,56,34), (16,26,34), (14,56,23), (15), 

(14,26,36), (13,46,25), (12,46,35), (16,24,36). (16,46,23)], 

[(34), (16,34,25), (16,24,36), (16,36,24), (13,25,46), (46), (35), (16, -36,24), 

(14,26,36), (14,36,25), (13,24,56), (56)]. 

t Since the nodes (14), (23), (66), (14, 23, 56), (14, 66, 23) form a Jordan pentahedron. 


Vol. 189. A. 


33 



J, A. Todd 


352 

Since L keeps only one node fixed the operations of this type arising from different 
nodes are distinct. Since the squares of these operations are of type VIII, the 
aggregate of operations of this type forms at least two conjugate sets in G. It will be 
proved that there are exactly two such sets. For the eight nodes belonging to the two 
cycles of four lie in a Jacobian plane through (45). The four nodes in any one of these 
cycles lie alternately on two /c-lines through (45), and the joins of consecutive nodes 
pass, in order, through the four nodes of the other cycle. Thus either cycle determines 
the other. There are six pairs of cycles of this kind associated with each Jacobian 
plane through ( 45 ), and it is easy to see that there is a collineation in 0 which 
transforms any one of them into any other. Thus, of the 96 operations such as L 
which keep (45) fixed, there are two associated with each of the 48 pairs of cycles, 
and if one of these is L the other is clearly L^, Then every operation of the same type 
as L is conjugate to one of the two operations L^, But these are not conjugate, 
since their squares are non-conjugate operations of type VIII. Thus G contains 

XV. operations of period 12 , products of five projections, forming two conjugate 

sets O /2160 operations each, whose squares are of typeVIII and whose cubes are of type X. 

I 

Since the types I to XV include 25920 distinct operations, all the operations of G 
are accounted for, and it follows that every operation of G is expressible as the prodwt 
of five or fewer projections. The distribution of the operations of (? in conjugate sets 
agrees with that obtained, by very different methods, by Frame ( 1936 , p. 483). 

7. Some sxtborottes of Q 

As an illustratioh of the usefulness of the classification of operations of G under¬ 
taken above it will be shown how the nature of the operations of five important 
maximal subgroups of G can be determined. These five subgroups are: 

(i) the subgroup G^ of order 960 leaving invariant a Jordan pentahedron, 

(ii) the subgroup G 2 of order 720 leaving invariant the nodes of a 16-set, 

(iii) the subgroup G^ of order 648 leaving invariant a Steiner prime, 

(iv) the subgroup G^ of order 648 leaving invariant a Jacobian plane, 

(v) the subgroup G^ of order 576 leaving invariant a node. 

Of these, G^ and G^ have been described in §§ 6'3 and 6 respectively. The procedure 
for determining the operations of the other subgroups will be given in outline, 
leaving the details to the reader and referring him to the table at the end of the 
section for the results. 

For one observes first that there are 192 operations which transform a Jordan 
pentahedron into itself and keep a node of this fixed, these forming a subgroup of 
index 3 in the (?g which keeps the node fixed. These operations are easily identified 
in the list of operations of G^ already given. By examining the permutations effected 
on the nodes of the Jordan pentahedron by these operations it is easy to see, first, 
that from the five vertices of the pentahedron 676 distinct operations are obtained 
in this way, and secondly, that these operations all effect even permutations of the 



353 


On the simple group of order 25920 

five nodes of the pentahedron. Hence, as 576 > I-.960, every operation of per¬ 
mutes the vertices of the pentahedron evenly. Any of the remaining 384 operations 
of O-i thus effects an even permutation of the nodes of the pentahedron which 
keeps no vertex fixed. Such a permutation must therefore consist of a single cycle of 
five vertices, and the corresponding operation of must be of type IX, the only 
type whose period is divisible by 6. 

For 0^ it is sufficient to observe that, from a consideration of the 16-set consisting 
of the nodes {ij), the group is simply isomorphic Vith the symmetric group of degree 
6, and is generated by the projections p{ij) which correspond to transpositions. The 
only operation of whose nature is not at once clear is the cychc operation of period 
6, and it may be seen, either by reducing the product of projections corresponding 
to such an operation, or (more simply) by considering the natures of the square and 
cube of the operation, that it is actually of type XII. 

For (?4 there have been given in § 4-3 54 operations generated by projections from 
the nodes lying in a Jacobian plane, and from the list of operations of given in 
§ 6-3 one' can easily select a further 108 operations, associated with each Steiner 
prime through the plane, which leave the plane invariant. These account in all for 
486 operations of Q^, and the remaining operations are easily seen to consist of 108 
operations of type X'V and 54 of type X which are the cubes of these. 

In table 1 are enumerated the operations of each of the fifbeen types in Q and its 
subgroups Gx, G^, G^, G^, G^, and also, for each type of operation, its period p 
and the least number n of projections whose product is of the given type. Each type 
of operation corresponds to a single conjugate set in G with the exception of those 
marked by an asterisk, which each yield two conjugate sets. 


Table 1. List oe opbeations oe the simple group G and its 

SUBGROUPS Gx, (?2. ^3. <3*4. ^5 

number of operations of type in 


type 

p 

n 



0^ 



<3^5 

I 

1 

— 

1 

1 

1 

1 

1 

1 

II 

2 

1 

45 

5 

15 

18 

9 

13 

III 

2 

2 

270 

70 

60 

27 

— 

30 

IV 

3 

2 

480 

— 

40 

84 

48 

32 

V • 

6 

3 

1440 

— 

120 

36 

72 

32 

VI 

4 

3 

3240 

120 

180 

162 

— 

72 

VII* 

3 

4 

80 

— 

— 

8 

26 

16 

VIII* 

6 

3 

720 

— 

— 

72 

90 

16 

IX 

5 

4 

5184 

384 

144 

^ . 

— 

— 

X 

4 

4 

540 

60 

— 

— 

64 

12 

XI 

3 

4 

240 

80 

40 : 

6 

24 

32 

XII 

6 

4 

‘ 2160 

80 

120 

64 

— 

96 

XIII* 

9 

4 

576P 

— 

— 

144 

144 

— 

xrv* 

6 

5 

1440 

160 • 

— 

36 

72 

128 

■ XV* . 

12 

5 

4320 

— 

— 

— 

108 

96 




25920 

960 

720 

648 

648 

676 


23-2 



354 


J. A. Todd 


8. Some sets oe gbneeatoes of Q 

I shall conclude by showing how the expressions of the operations of 0 in terms 
of pro] ections can be usefully employed in investigating various systems of generators 
of the group. It does not seem unreasonable to suppose that a systematic investiga¬ 
tion of such sets could he made along these lines; but no such investigation (which 
would of necessity be elaborate) has yet been made, and I content myself with a few 
illustrative examples, some of them new and others confirming results already 
obtained. 

The group Q is g'enerated by the 43 projections (or, indeed, as it is simple, by the 
operations of any one of its conjugate sets). But these are connected by many 
relations, and it is natural to inquire as to the minimum number of projections in 
terms of which aU the others can be expressed (and hence any operation of 0 can be 
expressed). This number must be five at least, since any four nodes determine a 
prime, and it has already been seen that the projections from the nodes which He in 
any prime generate a proper subgroup of Q. In fact, five projections suffice. This is 
shown by proving that any projection can be expressed in terms of the projections 
from the five nodes 

(12), (16), (23), (46), (13,26,34), 

% 

(which are clearly typical of numerous other similar sets). The proof rests on the 
identities * 

p{ij)p(jk)p(ij) = p{ik), 

p(ij) p(ik,jl, mn)p{ij) = p{jk, il, mn), 

p{ij)p{kl)p(mn) = p{ij, kl, mn)p{ij, mn, hi), 

which are consequences of the fundamental relations ( 1 ) between the projections 
given in § 2 and the relations of incidence between the nodes. From the first of these 
it is easy to see that every p{ij) for which + 3 is expressible in terms of p( 12 ), 
p(16), jp(25),p(46)—^in fact these nodes {ij) are just the nodes of an ra-prime. From 
the second identity it easily follows that every p{ij, kl, mn) is expressible in terms of 
these ten projections and jo(13,26,34). Whence, from the third identity, p{iZ) can 
be found, provmg the theorem. 

Coxeter has shown (Ooxeter 1940 , p. 474) that the relations 

173 = F® = F 2 = Z* = 72 = iJJVf = {YWY = (FZ )3 = (Fr )3 

= (C7TF)2 = {UXf = (177)3 = (FZ)3 = {WY)^ = (XY)^ = 1 

constitute an abstract definition for G in terms of five suitably chosen operations 
U, V, W, X, 7. These generating relations are satisfied by taking 

U = p(23)p(12)p(36)p(45), F = p(23)p(36), W = p(13)p(45), 

X = p(23)p(56), Y=p(16, 23,34)p(13,36,24). 

Here we recognize U as ah operation of type XI and period 3, and F, W, X, 7 as 
operatiops of type HI and period 2. That the other generating relations are satisfied 



355 


On the simple group of order 25920 

is most simply shown by reducing the products of pairs of these operations to one of 
the standard types by means of elementary transformations. Two examples of the 
process will suffice. Thus 

XJ == p(23)i)(66) j)(16,26,34)p(15,36,24) = p(16,36,24)p(23)p(66)p(15,36,24) 

= p(16,26,34)p(16,36,24), 

an operation of type III and period 2 , and 

UV = ^)(23)^)(12)^>(56)p(46)^)(26)p(36) = p(13)p(23)jp(56)p(26)^)(24)p(36) 

= p(13)p(23)p(26)p(26)p(36)p(24) = p(13)p(35)p(26)p(24), 

an operation of type XI and period 3. The remaining relations may be verified in a 
similar way. 

That these five operations generate 0 follows from the stronger theorem that Q 
is generated by the two operations H = TJVW and K = XT. That 0 possesses a 
pair of generating operations seems to have been first pointed out by Brahana 
( 1930 , p. 633), and the identification ofBrahana’s operations with if and X is stated, 
without proof, by Coxeter ( 1940 , p. 476). A simple proof of the fact that H and K 
generate 0 will now be given. Since 

VVW = p(13)p(36)p(26)p(24) .p(13)p(46) = p(16)p(26)p(24)p(46) • 

= p(42)p(46)p(16)p(46) = p(42)p(46)p(45)p(41), 

an operation of period 6 and type IX, it follows that 

H = p(42) p(46) p(46) p(41), K = p(16,26,34)p(16,36,24), 

and EK = p(42)p(46)p(46)p(41).p(16,26,34)p(16,36,24) 

= p(46) p(41)p(26) p(15)p(16,26,34) p(16,36,24) 

• , =p(46)p(41)p(43)p(16,34,26)p(16,36,24),t 

which is the operation L of type XV and period 12 considered in § 6 . Thus the group 
generated by H and K contains L, and hence contains = p(46). It was showij 
in § 2 that if an operation T oi 0 transforms a node A into a node A', then 

Tp{A) T-i = p(A'). 

Hence, since H permutes the nodes in such a way that the digits in their symbols 
undergo the permutation (15624), the operations for r= 1,2,3,4 are 

respectively p(16),p(26),p(46),p(12). And hence p(14) =p(16)p(46)p(16) belongs 
to the group generated by H, K, and so does Ip(14) which is p{16,26,34), since L 

transforms the node (14) into (16,26,34) (from the expression of L as a permutation 
of the nodes given in § 8 ). Thus the group generated by H and K contains the 
projections from the five nodes (12), (16), (26), (46), (16,26,34), and it was seen at the 
beginning of the present section that these projections generate the entire group. 

t Since (15), (26), (34), (16, 26, 34), (15, 34, 26) form a Jordan pentahedron. 



356 


J. 'A. Todd 

It is possible to show that & can also be generated by the two operations L and 
p{A), where A is one of the nodes belonging to a cycle of twelve permuted by L. 
It is easy to verify that, in the first of the two cycles listed in § 6, the join of any two 
consecutive nodes in the cycle is a /c-Iine, whose third node is one of those permuted 
by L in a cycle of six, and that in the operation U the second cycle has this property. 
Since {Uy = = L it follows that there is nothing lost in generality by supposing 

that A belongs to the first cycle. And since the nodes in any one cycle are permuted 
cyclically by L, the projections from these nodes are all of the form Jj'TL~^, where 
T is any one of them. It is thus sufficient to prove that L and y(14) generate the 
group. Prom what has been said, the group generated by L and jp(14) contains the 
projections from all 12 nodes of the cycle to which (14) belongs. And from the 
projections from two consecutive nodes in the cycle can be deduced the projection 
from the third node on the x-line joining them, which is a node belonging to the 
second of the two cycles of six. And the projection = ^5(46) also belongs to the 
group. .Thus, in particular, the group generated by L andp(14) contains the pro¬ 
jections from the nodes (23), (26) and hence the projections from (36) and the 
remaining nodes of the first cycle of six to which (36) belongs. Thus the group con¬ 
tains the five projections p(12), j)(14),p(23),j3(26), j>(15,26,34), and these are easily 
seen to form a set of generators of O similar to that given at the beginning of the 
section. 

It will be noticed that 

l2)(14) = |)(46)i>(41)p(43) jj(16,34,26)p(16,36,24)p(14) 

= jj(46)ij(13)p(13,26,46)^3(12,46,36), - 

and it is easy to see that the four nodes concerned in these four projections determine 
an w-prime, so that jijp(14) is an oJ)eration of period 6 and type IX. The operation 
L'-ip(14), whose inverse p(14) L is conjugate to Lp{lA), is of the same type. Also, 

L’p(14) = ie.ip(14) =^)(46).33(46)p(13)p( 13, 26,46)p(12,46,36) 

• = 25(56)p(46) p( 13) p(13,26,46) 33 ( 12 ,46,36) 

= 39 ( 66 ) 33 ( 26 ) 33 ( 13 ,46,26)33(12,46,36)t 
= 33 ( 66 ) 33 ( 26 ) 33 ( 12 ,46,36) 33 ( 13 ,46,26). 

The product of the first three factors is easily seen to be an operation of type VIII, 
associated with a Jacobian plane containing the nodes (66), (26), (12,46,36), which 
keeps fixed the node (16,46,23). Andsincethisnodeis joinedby a/c-lineto (13,46,26) 
it follows that ^’ 33 ( 14 ) is an operation of type XIII and period 9. So, also, 
which is conjugate to 33 ( 14 ) i® = [i'^p(14)]~^, is of period 9. 

Hence 133 ( 14 ) and L'’p{\A) have periods 6 and 9. It is easily seen that the same is 
true if 33 (14) be replaced by the proj ection from any other node in the cycle of twelve 

t Since (13), (26), (45), (13, 26, 46), (13, 46, 26) form a Jordan pentahedron. 



On the simple group of order 25920 367 

to which (14) belongs, and that for projections from nodes of the other cycle of twelve 
the corresponding periods are 9 and 5. 

Since L and 14) generate Q so also do either of the pairs of operations Lp( 14), 14) 

or Up{l4c), 3?(14), that is, O admits a pair of generators of periods 5, 2 or 9, 2 whose 
product is of period 12. Frame has given, aa a pair of generators of G, two operations 
of periods 9 and 2 whose product has period 6 (Frame 1936, p. 482). It is easy to find 
such a pair. In fact, Q is generated by the two operations i/’p(14) and = 2?(46), 
whose product Lp{l4:) has period 6. To see this, write 

P = Up{U), Q=^L\ i2 = (3P = Lp(14). 

Then R belongs to the group generated by P and Q. Moreover, 

P“^P =^(14)jL“''.Z^( 14) = ^(14) iy®p(14) = p(14)jp(4:5)p(14) = p(15), 

and hence 

P-^RQ = p(15)L« = p(15)p(45) == p(45)p(14) = L^p(14) = Qp(14). 

Thus QP-^RQ = p{14) and so jp(14) and Rp{l4) = L belong to the group generated 
by P and Q. Thus, since L and p{l4) generate (?, so also do P and Q. 

A closely related pair of generators has been given by the author (Todd 1936, 
p. 173), namely, a pair of operations 8 and T each of period 5, such that 8T is of 
period 9 and ST~^ is of period 3. Such a pair of operations is given in fact by 

8=QP, T^PQ, 

where P and Q are the operations just described. For these are of period 5, and 
8T = QP^Q is conjugate to P^ (and so of period 9), while 

3T~^ = QPQ-^P-^ = i25(14)^)(45)^)(14)P-’ = L[p{\4)p{4b)f L-\ 

which is of period 3. And, since = 1, T8 = P^ and {TSf = P. Thus 

P = {T8f, Q = {TS)-^T, 

are expressible in terms of 8 and T. Whence, as P and Q generate 0, so also do 8 
and T. 

It seems rather striking that the pairs of generators found, in very different ways, 
by various writers, should prove to be related to each other in so simple a manner. 

I should like to express my thanks to Professor Baker, who has very kindly placed 
the proof sheets of his tract at my disposal, and with whom I have had many stimu¬ 
lating discussions on the subject of this paper. 



358 


J, A. Todd 


Refbrenojbis 

Baker 1946 A locus with 26920 linear 8elf4ransformation$, Camb. Univ, Press. 

Brahana 1930 Ann, Math, (2), 31, 629-649. 

Burkhardt 1890 Math, Ann, 38, 161—224. 

Bfumside 1911 Theory of groups of finite order, 2nd ed. Camb. Univ. Press. 

Coble 1906 Amer, J, Math, 28, 333—366, 

Coxeter 1940 Amer, J, Math, 62, 457—486. 

Dickson 1901 Linear groups, with an exposition of the Galois field theory, Leipzig; Teubner. 
Frame 1936 Duke Math, J, 2, 477-484. 

Jordan 1870 TraiU des substitutions, Paris: Gauthier-Villars 
Moore 1897 Proc, Lend. Math, Soc, (1), 28, 357-366. 

Todd 1936 Quart, J, Math, (Oxford series), 7, 168-174. 


V 


The inversion spectrum of ammonia at 
centimetre wave-lengths 

By B. Bleanby and R. P. Peneosb, Clarendon Laboratory, Oxford 

{Communicated by Lord Cherwell, F.B.S.—Received 17 July 1946— 

Bead 27 Februa/ry 1947) 

A technique is described for the investigation of gaseous spectra at high radio-frequencies. 

The sensitivity of the method is such that an absorption of 2 x 10"“® per cm. of path can be 
detected at wave-lengths around 1 cm. 

The inversion spectrum of ammonia in the ground vibrational state has been examined at 
wave-lengths between 1*1 and 1*6 cm. and found to contain an extensive line structure, 
arising from centrifugal distortion of the molecule. Twenty-nine lines have been identified, 
each corresponding to a different rotational quantum state. The wave numbers of these 
lines can be accurately represented by the formula 

y == 0*7936-f[--0-00506;^+J+0*00704iiC2] 

+ 0-63[-0-0050J24.j+o*0070iC2p cm.-i (vacuo) 

where J, K are the rotational quantum numbers of the symmetrical top piolecule. 

Detailed measurements of 18 lines at a pressure of 0*6 mm. Hg show that their half 
widths, due to collision broadening, lie between 2 and 6x10“"* cm.'”^, while their intensities 
agree with the theoretical values, within the experimental error of ±5%. The integrated 
intensity at a pressure of 4-5 mm. Hg, over the interval 0-67 to 0*87 cm.“^, agrees with the 
calculated intensity to better than 1 %. 

L Introdtjotion 

The well-known doubling of most of the lines of the infra-red spectrum of gaseous 
ammonia is associated with the existence of two equilibrium configurations of the 
molecide, with the nitrogen atom on either side of the plane containing the three 
hydrogen atoms. The appropriate potential energy curve of the molecule has two 
identical miriima, separated by a fairly low hump; the associated ‘resonance’ 



The, inversion spectrum of ammonia at centimetre wave-lengths 369 

causes a splitting of the energy levels (‘inversion doubling’). In the ground vibra¬ 
tional state this splitting amounts to 0-67 cm.“^, determined by Wright & Randall 
( 1933 ) from measurements on the pure rotation spectrum in the far infra-red. 
Transitions between the two sub-levels are allowed, and an absorption line would 
he expected at a wave-length of about 1-6 cm. The existence of this absorption was 
verified experimentally by Cleeton & Williams ( 1934 ) using split-anode magnetrons 
as sources of radiation. The wave-length was determined by a diSraotion grating, 
and the absorption was measured by inserting ammonia (at atmospheric pressure 
in a cloth bag) m the path of radiation. The peak of the absorption curve was found 
to be at about 0 - 8 om.~^, and the absorption was spread over a considerable frequency 
range ( 0-6 cm."^) owing to collision broadening. 

This absorption has been re-examined usmg wave-guide technique. Low gas 
pressures, down to, 0-2 mm. Hg, have been employed in order to reduce the effects 
of collision broadening, and an extensive fine structure of the absorption has 
thereby been revealed. This fine structure is due to centrifugal distortion of the 
molecule, the effect of which is comparatively large because the inversion frequency 
varies very rapidly with the distance between the nitrogen atom and the plane of 
the hydrogen atoms. 

The absorption coefficients to-be expected at centimetre wave-lengths are much 
smaller than those encountered in the infra-red, and direct measurement would 
require considerable path lengths. The difficulties inherent in such a measurement 
have been avoided liy the use of a resonant cavity, which, effectively, is a short length 
of wave-guide traversed by the radiation many times before its final decay. The 
efficiency of such a cavity is measured by its magnification or Q value, defined as 
27 r times the ratio of the stored energy to the energy dissipated per cycle. On 
admitting an absorbing gas to the resonator its Q value is reduced, say, ftom Qo 
to Qi. The absorption coefficient of the gas is related to Qo> Qi by the expression 



The Q values were determined from the resonance curve obtained by detuning 
the cavity, and were subject to errors of a few per cent, owing to fluctuations of the 
frequency of the oscillator. Since the gaseous absorption is found from the difference 
of the two values of 1/Q the inaccuracy in its determination is considerable unless 
1/Qi is appreciably different from l/Qg. This method is thus limited to absorptions 
comparable with the ‘equivalent absorption’ ^ir/XQ^ of the cavity. The value of 
Go was about r0,000, from which Stt/AGo » ® ^ 1 ®-* per cm. 

To extend the method to sihaller absorptions, use was made of the fact that the 
power output from the oscillator was constant within less than 1 %. When the cavity 
is tuned to resonance, small fiuctuations in the oscillator frequency have only a second 
order effect on the amplitude of the energy density in the cavity, instead of the first 
order effect obtained when the resonator is detuned for the measurement of Q. 
The energy density in the cavity can be determined by a loosely coupled detector. 



360 


B. Bleaney and R. P. Penrose 

being proportional to the power registered by this detector, a crystal rectifier. 
Simple equivalent circuit considerations show that this power $ varies as Q^, giving 



Hence we have 



( 3 ) 


Thus it is possible to determine the absorption by measuring either or Qq together 
with the power ratio The stability of the oscillator (a reflexion klystron) was 
such that a difference between dj and of 1 % could be detected, corresponding to 
an absorption coefficient of about 2 x 10~® per cm. At the higher attenuations, the 
values of y given by (1) and (3) were compared, and the agreement was generally 
satisfactory. The probable error of the mean values of the absorption coefficients 
obtained from this comparison is estimated at about ± 3 % or + 2 x 10~® per cm., 
whichever, is the greater. 

The wave-length of the radiation was measured by a second resonant cavity. It 
was carefully designed to give an accuracy of 2 parts in 10,000 or better. Small 
changes in wave-length, such as were used in measuring the breadths of lines at 
low pressure, could be measured more accurately, the limit of reading corresponding 
to a change in wave-number of about 0*00002 cm.~^. This wave-meter was used 
because apparatus for the measurement of frequency by means of the harmonics 
of quartz crystals was not available. The latter method would give greater accuracy, 
and would also be desirable because the frequency of a line is more fundamental 
than the wave-length, through its relation to the quantum of energy. 

The essential differences between the technique described in this paper and that 
normally used in the infra-red may be summarized as follows: 

(a) The radiation from the source is collimated by a wave-guide, which produces 
a beam of radiation less than a wave-length across. This avoids the use of mirrors, 
which would have to be several metres in diameter to collimate the radiation at 
these wave-lengths. 

(b) The effective multiplication of path length obtained by using a resonator 
makes it possible to detect absorptions of the order of 10“® per cm. The volume of 
the resonator is less than 10 c.c. and the gas can be rapidly admitted or pumped 
away. The energy densities in the cavity with and without gas could therefore 
accurately be compared without requiring long-period stability in the oscillator. 

(c) The reflexion klystron is virtually a monochromatic source, and the limit of 
resolution is determined by the widths of the absorption lines themselves, at the 
gas pressures used in these experiments. In the infra-red the sources provide a 
continuous spectrum of radiation and the resolving power is that of the spectrometer. 

Although the methods described in this paper for the examination of gaseous 
spectra have been applied only over a small wave-length interval between 1 
and 1*5 cm., they are capable of extension to decimetre, centimetre and millimetre 



The inversion spectrum of ammonia at centimetre wave-lengths 361 

wave-lengths. Oscillators and detectors are available at all except millimetre wave¬ 
lengths, and it is likely that they will be developed for this region soon, as work at 
5 mm. has already been reported. The strength of the absorptions to be expected, 
and the sensitivity of the apparatus, vary with roughly similar powers of the wave¬ 
length. Getevis paribus, the absorption coefficients vary inversely as the square of 
the wave-length (see § 6 , equation (4)). If the resonant cavity is scaled in proportion 
to the wave-length, its Q varies with and the smallest detectable absorption will, 
by equation (3), vary with A~®. Thus for measurements at wave-lengths between 
100 and 1 mm., the effective merit of the method changes by a factor of 10 . At wave¬ 
lengths beyond this interval, the limitations of the method are mainly mechanical; 
at decimetre wave-lengths the cavity becomes inconveniently bulky, while its 
increasing Q requires stiU greater frequency stability of the oscillator. At the shorter 
wave-lengths the minute dimensions cause machining difficulties, and the use of 
a large cavity resonant in many modes, instead of a small cavity resonant in one 
particular mode (see Autler, Becker & Kellogg 1946 ) appears more promising. 


2 . Apparatus 
Resonator 

A cylindrical cavity was used, shown in figure 1 . It was tuned to resonance by 
moving the plunger A by means of a micrometer head. The Hq mode of resonance 
was employed because it has the highest Q value for a given size of cavity; a special 
system of coupling between cavity and wave-guide avoided the excitation of other 
modes as far as possible. This coupling system is described by the authors elsewhere 
( 1946 a). Further discrimination against unwanted modes was provided By a vacuum- 
tight glass tube (J5), containing water, attached to the back of the plunger. All 
modes except the Hq tend to leak out of the main cavity past the plunger, and are 
damped by the strong absorption of the water. 

To make the cavity vacuum tight, thin mica sheets were waxed over the coupling 
holes on the outside of the resonator. The problem of moving the plunger inside the 
vacuum was overcome by sealing the annular space ((7) between the micrometer 
spindle and its bushing with a viscous oil of low vapour pressure. By this means the 
position of the plunger could be controlled with the full accuracy of the micrometer 
screw; the thrust of the atmosphere on the spindle was sufficient to eliminate back¬ 
lash. Other joints in the resonator were waxed over or sealed with oil; the pressure 
inside could be reduced below 10 -® mm. Hg by a mercury diffusion pump. 

Gas was admitted to the chamber through the hole (D) outside the high-frequency 
field. The higher pressures were measured on a mercury manometer; a McLeod 
gauge was used for those below 2 mm. 

The plunger travel was sufficient to give at least'the first two Sq resonances at 
aU the frequencies employed. The plunger movement needed to tune between the 
half-power points was about 0-005 mm.; the corresponding micrometer head rotation 



362 


B. Bleaney and R. P. Penrose 

was measured by a lamp and scale method which gave an over-all magnification 
of the plunger movement of 12,000. 



FiGinRE 1. Resonator. 


Wave-meter 

The flj, cavity used for wave-length measurement was coupled in the same way 
as the experimental cavity, but it was larger in diameter. This makes the readings 
less sensitive to machining errors in the barrel. Inaccuracies in the barrel diameter 
and in the micrometer head made roughly equal contributions to the over-all error 
in the wave-length value; this was believed accurate to 1 part in 5000. 














The inversion spectrum of amrmnia at centimetre wave-lengths 363 

Oscillator cmd detector 

The power source was a reflexion klystron designed and constructed in this 
laboratory. The power available on the output side of the resonator was so small 
that a crystal detector had to be used. The accuracy with which attenuation can be 
measured depends on the extent to which the detector output is proportional to the 
incident power. The response characteristics of several crystals were compared 
with that of a thermocouple, whose reading should give a true indication of power. 
These experiments indicated that the error introduced in the attenuation values 
by deviations of the crystals from the square law should be less than 5 %. 

An attenuator in the resonator output feeder served to adjust the power at the 
detector to a convenient value. 


3. Expbkcmbntal 

At each wave-length the Q value of the resonator was measured by determihing 
the movement of the plunger required to tune the resonator between the half-power 
points. With the optical magnification this corresponded to deflexions of between 
6 and 20 cm. and the average of a number of readings was taken. The scatter in these 
readings was usually a few per cent, caused mainly by fluctuations in the frequency 
of the oscillator. A check on the accuracy of this determination was obtained by 
admitting to the resonator a quantity of ammonia sufl&cient to diminish the Q 
appreciably. The new value of Q was measured, together with the power fed through 
the cavity at resonance before and after admitting the ammonia. The ratio of the 
square root of these powers should equal the ratio of the two vdlues of Q (equation 
(2) above). In general the difference between the two ratios was within the experi¬ 
mental error, and an average value of the change in Q was used to calculate the 
absorption. Measurements of the absorption coefficients at a number of pressures 
were made, the smaller absorptions being determined from the power change regis¬ 
tered by the crystal detector. The absorption coelfioients at the various standard 
pressures (e.g. 4'6, 1'2, 0-6 mm. Hg) were obtained by interpolation on a graph of 
absorption coefficient against pressure. 

Measurements at small frequency intervals were made to delineate each absorp¬ 
tion line at pressures of 1-2 and 0*6 mm. Hg. At the latter pressure most of the lines 
were resolved well enough to make possible an accurate determination of their 
breadths. A few lines were separated only at a stiU lower pressure (0-2 mm.). 

Whenever possible, the exact centre of each line was found by a brief examination 
of the absorption at a pressure of less than 1/10 mm. The value thus determined was 
in close agreement with that deduced from the absorption curve for 0-5 mm. pressure. 

4. The speothttm 

The absorption curve obtained at a pressure of 1'2 mm. Hg is shown in figure 2. 
The measurements cover the wave-number band from 0*65 to 0-87 cm.~^, which 
includes aU the more intense lines. The coordinates are given both in terms of 



364 B. Bleaney and R. P. Penrose 

absorption coeflS.cient per centimetre path (the usual spectroscopic unit) and of- 
decibels per kilometre. The latter unit is customary in radio engineering and is 
particularly convenient for the small absorptions to be expected at these 'wave- 
lenths. The conversion factor is given by 

1 db./km. = 10 ®(logioe) x absorption coefficient/cm. 



FrouBH 2. The spectrum of ammonia near 0-8 cm."’’. 


The widths of the lines shown in figure 2 are about 10 “® cm.~^. The widths’to be 
expected from radiation damping {‘natural line breadth’) and the Doppler effect 
are respectively some and 10 “® cm.~^. Thus, even at a pressure of 1-2 mm. Hg, 
only collision broadening of the line need be considered, and the lines would be 
expected to narrow still further at lower pressures. This has been confirmed using 
pressures down to 1/10 mm. Hg. The actual breadths of the lines indicate a collision 
diameter several times greater than the kinetic theory value. This may be attributed 
to the strong electric field of the ammonia dipole, and will be discussed in another 
paper. 

Identification, of the lines 

The theory of the inversion splitting of the energy levels of the ammonia molecule 
has been considered by Demuson & Uhlenbeck ( 1932 ). Using the Wentzel-Kramers- 
Brillouin approximation, they showed that the ratio of the splitting d to the normal 
vibration frequency v should vary exponentially with the area cut from the potential 
hump by the vibrational energy level, being thus a delicate function of the height 
. of the pyramid. Only in the case of a very flat pyramidal molecule, such as ammonia, 
is the splitting in the ground vibrational state large enough to be detected, and in 
this case the slight centrifugal distortion of the molecule iu the higher rotational 
energy levels may cause a considerable change in the splitting, and therefore in the 
inversion frequency. 


decibels per kilometre 



The inversion spectrum of ammonia at centimetre wave-lengths 365 

The effect of centrifugal distortion can be seen qualitatively from the following 
considerations. When the molecule is rotating mainly about its symmetry axis, the 
hydrogen atoms tend to fly outwards, thus decreasing the height of the pyramid and 
increasing the inversion frequency. Rotation about an axis perpendicular to the 
symmetry axis will have the opposite effect. The energies associated "with these two 
rotations depend on and + respectively, and, by an extension of 

Dennison & Uhlenbeck’s treatment, Hsi-Yin Sheng, Barker & Dennison ( 1941 ) 
have shown that v should satisfy the relation 

= v^-A{J^+J) + BK\ 

They have examined the fundamental vibrational band at lO/t and have shown that 
their observations can be represented by a formula of this kind. They have also 
calculated the constants A and B, using the potential energy curve proposed by 
JManning ( 1935 ) and obtain values in satisfactory agreement with the experimental 
determinations. By a similar computation, they find for the ground level of the 

vibration v = v^- 0-001 1 (+ J) + O-OOlOiT^ cm.-^. 

In attempting to find a forimula of this kind which would adequately represent 
the system of lines of figure 2 , it was assumed that the series of strong lines at the 
high frequency end of the spectrum corresponded to the levels J = K which have 
both large statistical weights and large transition probabilities. 

The fact that the levels for which is a multiple of three have twice the a priori 
statistical weight (due to nuclear spin) of the others suggested that the lines at 
0-797 and 0-836 cm.~^ must be identified as {J,K) = (3,3) and (6; 6). This yielded 
the values 

A — 0-006jOm.~^, B — 0-007^ cm."’-, Pq = 0-794 cm."’. 

With these constants the positions of the remaining lines could be predicted, and 
fair agreement was found with the frequencies of the lines found experimentally. 
The discrepancies between the predicted and observed values increased steadily 
as the observations advanced towards the lower frequencies. This was not sur¬ 
prising, for {v — Vq) amounted to 20 % of Vq at the lower limit of the frequency band. 
In the calculation of Hsi-Yin Sheng, Barker & Dennison the formula for the splitting 
includes only the first order term in the expansion of an exponential, and other 
effects which would yield second-order terms are omitted. In the ground vibrational 
state the fractional change in the splitting is so large that the second-order term 
from the exponential will be considerable, and it can be included without the 
introduction of further unknown constants. The formula* 

V = 0-7935-0-00605(^2+J) + o- 00704 .B :2 

+ o-63{— 0 - 0060 ( J 2 + /) + 0-0070^2)2 cm."’ (vacuo) 

* The value for Vo given here (0-7936 cm."’) differs slightly &om that in a preliminary report 
(Bleaney & Penrose 1946 6 ) where, owing to an error in correcting the wave ninnbers to vacuum, 
a value of 0-7940 cm."'- was gi-ven. 



366 


B. Bleaney and R. P. Penrose 

was adopted as giving the best fit with the experimental results. In table 1 the ob¬ 
served jfiequencies of the lines are given, together with the values (in italics) calculated 
from this formula. 

The calculated frequencies for the K = 0 states are included for comparison with 
the value of the inversion splitting found by Wright & Randall in their experiments 
on the rotational spectrum (vide infra). 


Table 1 . Wave-nttmbebs or lines in the spectrum 


Figures in italics are calculated from the formula 




v = 0-7935 ■ 
0 


- 0-00505(+ J) -f O-OOTO^iiC” H- 0-63{ - 0-0050(/» + J) + 0-0070K“}* cm.-i (vacuo) 


0 

{0-7935) 










1 

{0-7834) 

0-7904 

0-7904 









2* 

{0-7638) 

0-7706 

0-7705 

0-7913 

0-7913 








3 

{0-7352) 

0-7418 

0-7417 

0-7617 

0-7617 

0-7964 

0-7963 







4 

{0-6989) 

0-7050 

0-7050 

0-7240 

0-7239 

0-7569 

0-7568 

0-8052 

0-8052 






5 

{0-6564) 

0-6616 

0-6624 

0-6795 

0-6797 

0-7100 

0-7103 

0-7666 

0-7555 

0-8183 

0-8184 





6 

{0-6098) 

0-6149 

0-6308 

0-6590 

0-6587 

0-7003 

0-7002 

0-7583 

0-7582 

0*8360 

0-8359 




7 






0-6939 

0-6938 

0-7648 

0-7640 

0*8581 

0-8682 



8 







0*6910 

0-6909 

0-7750 

0-7751 



9 








0-6916 

0-6925 

0*7892 

0-7895 


10 









0-6954 

0-6954 

0-8076 

0-8083 

11 










0*7031 

0-7029 


■I The agreement between the experimental and calculated values is very good. 
I^he greatest discrepancy (0-0008 cm.“^) occurs for the line (6,1), for which even the 
third order term in the exponential would be appreciable. The agreement leaves 
little doubt as to the correctness of the identification of the lines, which is confirmed 
by comparison of the observed and theoretical intensities (see below). 

Since the work described in this paper was completed another investigation of 
the ammonia spectrum has been briefly reported by Good (1946 a). The narrow lines 
obtained at a pressure of 1/10 mm. Hg or less were displayed on a cathode ray tube 
screen. The frequencies of the individual lines are not given, except in one case. 



The inversion spectrum of ammonia at centimetre wave-lengths 367 

23j878 Mcyo./sec. This is presumably the line (3,3) and is excellent agreement with 
our value of 0-7964 ± 0-0001 which corresponds to 23,876 + 3 Mcyc./sec, The 

formula suggested to represent the wave-numbers of the lines is 

V = 0-7932-0-0048(J2-hJ^-Z2) + 0-0020Z2.* 

The difference between the coefficients in this expression and the corresponding 
coefficients in ours may be due to adjustment of the values to give the best fit on all 
28 lines observed without the use of a second-order term. This would lead to the 
choice of a smaller numerical value for the negative coefficient of ( -I- J — K^), 
since the second-order term is essentially positive. There is no doubt that the 
identification of the lines is the same as made by us. 

The experimental values of the constants A, B oi the splitting are considerably 
greater than those calculated by Sheng, Barker & Deimison. Their calculation rests 
on the potential energy curve for the ammonia molecule proposed by Manning 
( 1935 ). This curve has no firm foundation, and was adopted to obtain an exact 
solution of the wave-equation, the constants being adjusted to fit the energy levels 
with the observed vibrational frequencies. It gives satisfactory values for the in¬ 
version splitting of the vibrational levels, and for the fine structure of the 10 /:t 
vibrational band. Modification of this potential curve, to give better correlation 
with the results reported in this paper, would presumably be confined to the region 
where the potential energy is not much greater than the energy of the ground 
vibrational level. This would not affect the agreement with the previous observations. 

6 . MaGNECUDB OB THE ABSOBPTIOlSr 

Since the experimental errors in the measurement of the absorption coefficients 
are not expected to amount to more than a few per cent, an accurate comparison 
with the values predicted by the quantum mechanical intensity formula is possible. 
This formula is (of. Van Vleck & Weisskopf, 1945 ) 

since hv hT in this region. Here Njj^^ is the number of molecules per c.c. occupying 
the level with rotational quantum numbers {J,K) and is the transition 

probability for the inversion. Dennison ( 1931 ) has shown that 

I 12 _ 

* Note added in proof* In a later {)aper Good (1946&) gives the formula 

V = 0«79347-0-005048(J»+J) + 0*007040j8:* 

+ 0'0000I546( + Jf - 0*00004260( ja+J) iiCa + 0-00002920JS:S 

“where all the constants are empirical. TShe close agreement between this formula and that 
proposed by us, which involves only three empirical constants, has been pointed out elsewhere 
(Bleaney <fc Penrose 1946 c). The accuracy of determination of the wave-numbers of the lines 
claimed by Good (± 0*02%) is the same as m our experiments. 


Vol. 189 . A. 


24 



368 


B. Bleaney and R. P. Penrose 

where [i is the permanent dipole moment of ammonia, for which the value 
1*44 X e.s.u. is taken in these calculations. 

For comparison with the experimental results two methods have been used. The 
first of these involves a determination of the total absorption due to a number of 
lines; from equation (4) it is obvious that this requires evaluation of the area con¬ 
tained under a curve of [yjv^) against v. This area is then to be compared with the 
theoretical value obtained by summation of the appropriate values of | /ijj^ 
for the lines whose wave-numbers lie in the range of integration. In computing the 
area under the experimentaT curve greater accuracy is obtained by using a curve 
determined at a pressure where the individual lines are smoothed over by the 
pressure broadening. The pressure should not be too high, however, for it is desirable 
to keep most of the absor|)tion within the band of frequencies covered by the 
experiments. 

The absorption curve measured at 4* 5 mm. pressure was chosen as the most suitable 
and the area between V = 0-67 and 0*87 om.“^ was found to be 

= 44*5db./km.om. 

A simple calculation indicated that this figure shotdd be increased by 4 % for the 
tails at the high and low ends of the frequency band, giving 46-3 db./km.cm. This 
figure includes, of course, the tails of those lines whose centres lie outside the band, 
but their contribution is less than 1 % and can be neglected. The value calculated 

from the statistical weights and transition probabilities is = 46*5 db./km.cm. 

The exact agreement of this and the experimental value is somewhat fortuitous, 
as a difference of several per cent would not lie outside the experimental errors. The 
latter are however mainly random in nature and the process of computing the 
integral has a similar effect to taking the mean of a large number of observations. 

A second comparison between the theoretical and experimental absorption 
coefficients can be obtained from the intensities of individual lines. This requires 
that the lines be sufficiently resolved to make possible a measurement of the total 
area under each line separately, or, what is equivalent, measurement of the breadths 
of the lines at half intensity. From the latter and the absorption at the centre of the 
line the total intensity can be found, assuming the shape of the absorption curve to 
be given by the usual simple formula 

In this formula Av is half the breadth of the line at half intensity, i.e. half of what 
is usually termed the ‘breadth’ in spectroscopy. We shall refer to it as the ‘line- 
breadth constant’. 

To obtain rather better resolution of the lines than at 1*2 mm. pressure (figure 2), 
a pressure of 0*5 mm. was chosen at which to make a detailed examination of each 



The inversion spectrum of ammonia at centimetre wave-lengths 369 

line. At this pressure all but a few lines were well resolved; these few were examined 
at lower pressures (e.g. 0*2 mm.) to determine their exact frequencies, though not 
their breadths. A t 3 q)ical absorption curve is shown in figure 3, the readings of the 
micrometer head being plotted as abscissae. The limit of reading of the micrometer 
head corresponds to .a frequency change of about 0*00002 cm.~^; thus small changes 
of wave-number can be measured with a probable error of this order. The micrometer 
reading is linearly related to the length I of the cavity, and changes in the reading 
can be reduced to changes in wave-number by the relation 

Av _ nAv _ 1 

AXg~~M~~Wg'‘ 

where is the wave-length in the cavity, determined from the positions of successive 
resonances, and n is the number of full wave-lengths in the cavity at the particular 
resonance used for the examination of the line. Both v and and, therefore, the 
conversion factor are known to a few parts in 10^. 

0-7550 0-7600 0-7650 



20-8 20-6 20-4 20-2 20-0 19-8 

■wave-length in cavity (mm.) 

Fioxtbe 3. Detail of spectrum at lower presstire (0*5 mm.) showing improved resolution. 


In table 2 the experimental values of the absorption coefficient (y^) at the cenire 
of the line, and the line-breadth constant (AV), are given for some seventeen lines 
(columns 2 and 3). In the fourth column is shown the intensity IjkI^ calculated 
from the formula 

= -^AV{10-^T^) {Vq = molar volume at 0*60 mm. pressure). 


24-4 



370 B. Bleaney and R. P* Penrose 

where the constant IO^Wq represents a reduction from a kilometre path to a path 
length such that a mole of gas would be contained per sq.cm, of cross-section. The 
units of are therefore decibels per mole. The corresponding values of 
calculated from the theoretical formula (4) are given in column 6, and do not deviate 
from the experimental values by more than ± 6 %. The measurements of the line- 
breadth constants, which range from 2 x 10“^ to 6 x 10”^ cm.*^, are liable to errors 
of the order ± 5 %. To compare the averages, the values of columns 4 and S have 
been summed separately; they agree very closely showing that there is no consistent 
deviation. 


Table 2. OoMPAEisoisr or measured akd calculated intensities 



Av at 



^^(calc, 

B (J, K) 

p = 0-5 mm. 

X 10”^cm.“^ 

7 o (db./km.) 

pa 

(db./mol.) 

2 . 1 

2-6 

51 

25 

27 

3, 1 

2-4 

29 

14-5 

14 

3, 2 

3-2 

100 . 

62-5 

50 

3, 3 

4*6 

360 

289 

289 

4,4 

4-6 

190 

149 

151 

6 , 1 

1-8 

7 

3-3 

3*5 

6,2 

2-6 

20 

13 

15 

6 , 3 

3-3 

96 

71 

73-6 

5, 6 

4-7 

186 

148 

137 

6 , 3 

3-1 

44 

35-5 

34 

6,4 

3-6 

42 

85 

34-5 

6 , 6 

4-7 

276 

198 

210 

7.6 

4-1 

28 

27 

27 

7,6 

3-9 

132 

100 

95-5 

8,7 

4-1 

44 

34 

32-5 

10,9 

4-2 

35 

25-5 

23 

11, 9 

2-9 

8-6 

Total 

5-7 

1237 

6*4 

1236 


The good agreement between the theoretical and experimental values of the 
absorption coefficients by both methods of comparison is in marked contrast with 
results obtained in the infra-red, where the experimental difficulties generally 
preclude accurate measurement of intensity. In the case of ammonia these difficulties 
were largely overcome by Foley & Randall ( 1941 ); from the intensity of the line at 
270 cm;"^ they deduced a value of 1*30 Debye units for the permanent dipole 
moment. If the measurements reported in this paper are regarded as yielding a 
dipole moment, the value 1-44 ± 0*01 Debye units is obtained. The values obtained 
by the usual methods are 1-44 (Zahn 1926 ), 1*48 (Watson 1927 ), 1*44 (Keyes & 
Kirkwood 1930 ) and 1-437 (Van Itterbeek & De Clippeleier 1946 ). 

6. Comparison with Wright and Randall 

The most intense lines in the spectrum described in this paper lie in the neighbour¬ 
hood of 0-8 cm.-i. On the other hand, Wright & Randall (1933) found that the 
splitting of the 4th, 5th and 6th rotational lines in the far infra-red (which should be 



The inversion spectrum of ammonia at centimetre wave-lengths 371 

twice the inversion frequency) was 1*33 omr^. There is no real discrepancy between 
these two values, however, because the transition probabilities are different for 
rotational lines {AJ = ± 1) and the lines in the long-wave spectrum for which 
(Zl/ = 0). In fact the lines corresponding to Z = 0, which are forbidden in the 
inversion spectrum, are those with the greatest intensity in the rotation spectrum. 
The inversion frequencies (table 2), observed or predicted by the splitting formula, 
have been used to calculate the mean splitting of the rotational transition / = 5 to 
J := 6, the lines corresponding to different values of K being weighted according to 
their statistical weights and transition probabilities. The value thus obtained was 
1-35 cm."“^ in good agreement with that observed by Wright & Randall, Only the 
mean values can be compared, since there is an additional splitting of the Z sub- 
levels due to centrifugal distortion which must also be considered in reconstructing 
the shape of the unresolved lines observed by Wright & Randall. 

We wish to make acknowledgement to Lord Cherwell for extending to us the 
facilities of his laboratory; and to Mr K. W. H. Stevens for his kind help with the 
experiments. The work was conducted under the auspices of the Board of Admiralty, 
to whom we are indebted for permission to publish this paper. 


Rbfeeenoes 

Autler, Becker & Kellogg 1946 Phy$. Bev. 69, 694. 

Bleaney & Penrose 1946 a Proc. Phys, Soc, (awaiting publication). 
Bleaney <fc Penrose 19466 Nature, 157, 339. 

Bleaney <fc Penrose 1946 c Phys, Rev. 70, 775. 

Cleeton <& Williams 1934 Phys, Rev. 45, 234. 

Dennison 1931 Bev, Mod. Phys, 3, 280. 

Dennison <fc TJhlenbeok 1932 Phys. Bev. 41, 313. 

Foley So Randall 1941 Phys. Bev, 59, 171. 

Good 1946 a Phys, Bev, 69, 539. 

Good -19466 Phys, Bev. 70^ 213- 

Hsi-Yin Sheng, Barker & Dennison 1941 Phys. Bev. 60, 786. 

Keyes <& Kirkwood 1930 Phys. Bev. 36, 1570. 

Manning 1935 J. Chem. Phys. 3, 136. 

Van Itterbeek & De Olippeleier 1946 Physica, 12 , 97. 

Van Vleck & Weisskopf 1945 Bev. Mod. Phys. 17, 227. 

Watson 1927 Proc. Boy. Soc. A, 117, 43. 

Wright & Randall 1933 Phys. Bev. 44, 391. 

Zahn 1926 Phys. Bev. 27, 455. 



Infinite irreducible representations of the Lorentz group 

By Harish-Chandea, Oonville and Caius College, Cambridge 
{Communicated by R A, M. Dirac, F.R,S.—Received 8 August 1946) 

It is shown that corresponding to every pair of complex numbers k, /c*** for which 2{k-^k*) is 
real and integral, there exists, in general, one irreducible representation ^ of the Lorentz 
group. However, if 4/c, 4/c* are both real and integral there are two representations 
and associated to the pair (/c, k*). All these representations are infinite except 

which is finite if 2/c, 2/c* are both integral. For suitable values of (ac, ac*), 
unitary. U and SC matrices similar to those given by Dirac ( 1936 ) and B’ierz ( 1939 ) are 
introduced for these infinite representations. The extension of Dirac’s expansor formalism to 
cover half-integral spins is given. These new quantities, which are called expmors, bear the 
same relation to spmors as Dirac’s expansors to tensors. It is shown that they can be used to 
describe the spin properties of a particle in accordance with the principles of quantum 
mechanics. 


1, Introdtjotion 

All the finite irreducible representations of the Lorentz group are well known. Every 
such representation is characterized by an ordered pair of numbers p, q such that 
2p and 2q are integral and > 0 . None of these representations, however, is unitary. 
Dirac (1945 J has recently drawn attention to the existence of some unitary, though 
infinite, representation^ with a view to their possible physical applications. The 
present paper is concerned with the investigation of the general irreducible repre¬ 
sentations of the proper Lorentz group. It is found that such a representation can, 
in general, be characterized by an ordered pair (/c, k^) of complex numbers such that 
2 (ac-/c*) is an integerf and may therefore be denoted by However, in case 
both K and /c* are of the form - 2 ) there exist two irreducible representations 
and corresponding to the pair (/c,/c*). The finite representations corre¬ 
spond to with /c, K* both of the form ln{n4= — 1 ), the connexion between (p, q) 
and (k,k*) being given byp=|/c+il~i, + In general is 

unitary only if either 

/c + iv+ \n, /c* = + \n, (1 a) 

or K = = \^\<h ( 16 ) 

Here v is an arbitrary real number. However, in the special case when /c, are both 
of the form ln{n^ - 2 ), is unitary only if | /c + || = | /c* + ^ |, while 
is unitary whenever | /c4* J ] = 1 /c*4-||. 

It is possible to construct a series in the complex variables %, u^ and their complex 
conjugates u^, u^ such that the coefficients of this series transform according to the 
irreducible representation when*a spinor transformation corresponding to a 

t The term ‘integer’ is used throughout to mean a real integer whether positive, negative 
or zero. The symbol n always denotes an integer, while s denotes an integer or half odd 
integer ^0. 


[ 372 ] 



373 


Infinite irreducible representations of the Lorentz group 

given Lorentz transformation is applied to the variables and These 

coefficients therefore form the components of a vector of the mathematical vector 
space 91^^ which transforms according to the representation It is clear that 
these vectors of 31/^^ bear the same relation to ordinary spinors as the expansors of 
Dirac to ordinary tensors. They may therefore be called expinors. Thus to every series 
in Ui, '^i? which is convergent in a certain sense, there corresponds an expinor. 
This expinor transforms according to the representation ^ series is homo¬ 
geneous in each of the two sets of variables (%, u^) and U 2 ) and has the degrees 
2k and 2/c* respectively. Such an expinor may therefore be called a homogeneous 
expinor. In case both 2k and 2 /c* are integral and ^ 0 , the corresponding homogeneous 
expinor is the same as the symmetric spinor with 2k undotted and 2/c* dotted 
indices. Homogeneous expinors are thus an extension of symmetric spinors. 

Further, as pointed out by Dirac ( 1945 ), it is possible to use expinors to describe 
the transformation properties of the wave function of a spinning particle. In a 
theory based on these expinors it is possible to make the charge density positive 
definite for particles of integral spin or the energy density positive definite for 
particles of half-integral spin, in contradistinction with the results of the existing 
theory (see Pauli 1940 ). This is made possible by the circumstance that infinite 
unitary representations of the Lorentz group exist for both integral and half¬ 
integral spins. By imposing subsidiary conditions on the wave function, it can be 
arranged that in the rest system the particle has a ‘pure’ spin 5 , i.e. aU the non- 
vanishing components of the wave function in the rest system transform according 
to the irreducible representation of the rotation group. 

2 . Thb obneeal ebeebsentation 

The finite representations of the Lorentz group are in fact obtained from those of 
the four-dimensional real orthogonal group by a device which is called theUnitarian 
trick’ by Weyl ( 1939 )* It consists essentially in replacing in a representation three 
of the real parameters of the orthogonal group by pure imaginary ones. However, 
this device does not yield all the possible representations of the Lorentz group. This 
arises from the fact that the topological structure of the orthogonal group is not the 
same as that of the Lorentz group, the difference consisting in the aperiodic nature of 
the temporal rotations’. In consequence, unlike the orthogonal group, the Lorentz 
group is not compact. 

We begin with the consideration of the infinitesimal transformations, which, as is 
well known, play a very important part in the theory of Lie groups. Such a Lorentz 
transformation is given by 

x'j, = x^+e^^xi = ( 2 ) 

ejj’s being infinitesimal quantities. (The usual metric tensor gj^ is used for raising 
and lowering the tensor indices §'o(, = — gr^ = —g^^ = — S'as = 1 , the other components 
being zero.) In any representation this transformation will be represented by 

1 + 


(3) 



374 


Harish-Chandra 


where are operators of the representation which may always be chosen to be 
antisymmetric in h, 1. The commutation relations of are completely fixed by the 
structure of the Lorentz group (see Van der Waerden 1932 ) and are 

[= ^ghmJlnji^^mJ^knj^gknpm^glnJ^krifh^ 

where \A,E\ = Put 

K^ = iP^ K^ = iP\ L^ = il^, L^ = iP^. (5) 

Prom the definition ( 2 ) it is clear that transform as components of an antisym- 
metrical tensor, under Lorentz transformations. Therefore it follows from (5) that 
for spatial rotations both (jKi» ^s) s-Jid {l>x> L^) transform as components of 

a three-dimensional vector. These two vectors will be denoted by K and L respec¬ 
tively. In terms of K and L (4) can be written as 


[K„K,] = iK„ 

(6a) 

[K^,L^ = iL^ = [L^,K^l 

(66) 

\_Lx, 

(6c) 


the remaining relations being obtained by cychc permutations of ( 1 , 2 ,3). 

Consider now an irreducible representation of the proper Lorentz group. The 
matrices P^ then form an irreducible set. Denote the representation space by 9i. 
Reduce this space with respect to the subgroup ^3 consisting of spatial rotations 
only. Now is a compact group, and it is well known that every representation of 
a compact group is completely reducible into a direct sum of irreducible representa¬ 
tions each of which is unitary and of a finite degree (see Pontrjagin 1939 , p. 206). 
Now every irreducible representation of is characterized by a number Ai > 0 which 
is integral or half-integral. Thus is decomposed into a direct sum of subspaoes SK* 
which are all irreducible with respect to d^. It will be assumedf that in this decom¬ 
position there occurs at most only one subspace Iftj, for any particular value of k. 
By choosing a suitable coordinate system in 91*, can be brought to the diagonal 
form with its eigenvalues m running through the numbers — 1 ,...,— jfc + l,— A:. 
Rurther, this coordinate system in 91* can always be so chosen that the matrix 
elements of and are given by 

{m\K+\m — l) = (m—1 \K-\m) = Aj{{k+m)(k—m+l)}, (7) 

where - K+= K^+iK^, K-= K^-iK^. ( 8 ) 

Thus K-^, K^, are represented by Hermitian matrices and 

K^ = k{k+1). ( 9 ) 

Now since L transforms as a vector under spatial rotations it follows (see, for 
example, Weyl 1931 , p. 200 ) that (^: | L | k’) = 0 unless ifc' = 1: or Jb ± 1 . In fact, the 

t Note added in proof. It is possible to avoid this assumption. The proof is then some¬ 
what more complicated but the final result is the same. 



Infinite irreducible represenMions of the Lorentz group 375 

matrix elements of L are completely fixed by its vector character, apart from a 
factor depending on k (but not on m). They are given by Weyl ( 1931 ), 

{k,m \ L+\k—l,'m — l) = —p-{k)f{{k+m) (^) + m— 1 )},'| 

(k,m— 11 ii'I k—l,m) = p~{k)^{{k—in+ 1 )(k — m)}, i 
{k,m\L^\k — l,m) = p-(k) ^{{k+m) (k—m)}, ) 

{k,m I L +1 k,m— 1 ) = p{k)Aj{{k + m) {k—m+ 1 )},' 

{k,m — l I L~\k,m) = p{k)^{(k—m + l) (A+m)}, ■ 

{k,m \ Ls\k,m) = p{k)m, 

{k,m \ L'^\k + l,m—l) = p+{k)Aj{{k—m + l)(k — m+2)},' 

{k,m — l I L-\k + l,m) = —p+(k),J{{k+m) (A+m+ 1 )}, 

{k,m\Li\k + l,m) = p+{k) ^{{k+m + l){k-m+ 1 )},^ 

the remaining elements vanishing. IL&se p-(k),p{k) and j)+(i!) are numerical coeffi¬ 
cients depending on k, and L+= Lj^+iL^, L~= L^—iL^. Now notice that it is 
possible to construct two invariants of the proper Lorentz group from J**, viz. 

— and I = (e*'*’"” is a tensor antis 3 rmmetric in all the 

indices and e®i 28 _ i) Since they are left unaltered by a proper Lorentz transforma¬ 
tion, and I commute with and therefore in an irreducible representation they 
are represented by a numerical constant. Now 

I = K,L^+K^h+K^L, = (KL), (11a) 

- = K* - La = i(A -h 1 ) - {L+L- + LI + K,). (116) 

Substituting from (7) and ( 10 ) in ( 11 ) one gets 

k{k + l)p(k) = I, (12a) 

ja = jfc(ifc +1) (1 -p\k)) - m( 1 + p^k)) -p+{k) p-{k +1) (1: - m +1 ) (2* 3) 

—p-(k)p+(k^l){k+m)(2k — l), (126) 

it being understood that p~{k) s 0 if i = 0 , |. For 4 = 0 there are at least two distinct 
values of m for which (126) holds. Hence the coefficient of m must vanish for k^O, 
i.e. 

(H-pa(;S;));fc =^+(i,)^-(*+l)*(2A;+3)-p-(A)^)+(i:-l)&(2*-l), (13a) 

ja = &(/fc + 1) {1 -p^ik)} -p+(k) p-(k + l)(k+l)(2k+3) 

-p-(k)p+(k-l)k(2k-l). (136) 

The factor A-has been introduced deliberately in (13 a) so as to include the exceptional 
case k = 0 . From (13) it follows that 

p+lk)pfik +l) = {k(k + 2)- k^\k) - J®} (2;b+i/(2fe+3 )' 


(10a) 


( 106 ) 


(10c) 



376 


Harish-Chandra 


Now let j be the lowest value of h appearing in the decomposition of 91 into 
First consider the case when j 4= 0. Bearing in mind that p“( j) = 0 one obtains from 


(13) and (12a) on putting Jfc = j 




P 

f 


1 . 


(16) 


Hence from (14) 

p+(k)p-{h+l) = 


{k-j + l){k+j+l) 
(2A: + l)(2ifcH-3) 


1 + 


(A+i)yj 




(16) 


On the other hand, if j = 0 then from (12a) / = 0 and so p(k) = 0, since one can 
always put p{0) = 0 without affecting the representation at all. Therefore from (14) 


p {k)p {k+l) (2i4.i)(2jfc + 3)\^ (k+1)^,' 


(17) 


(17) may be regarded as a special case of (16) and (16) for j = 0 provided one agrees 
to look upon I as jfi with /i = Ijj as a, definitely given number. This makes I vanish 
automatically when j = 0. Also from (12a) 

with the Tinderstanding that p(0) s 0. By multiplying all the eigenvectors in each of 
the subspaoes by a suitable normalization factor q{k) it can always be arranged 
that 

For convenience the first square root is supposed to be always positive. The second 
one may then be chosen at will for each k. In particular, if the representation of the 
Lorentz group is unitary, K and L are Hermitian so that p{k) is real and 

i)+(*) =i5-(A!+l), 


where the bar denotes complex conjugate. The normalization factors q(k) are then 
unimodular, and the corresponding transformation of Sft is unitary. Therefore every 
unitary representation of the Lorentz group can be brought into a form for which 
(19) holds, by means of a unitary transformation. Since unitary transformations 
preserve the unitary character it follows that the transformed representation is 
also unitary. Now (16) and (18) and (19) show that, apart from equivalence, an 
irreducible representation is completely fixed by the two numbers j and fi. Also as 
shown above a unitary representation can be transformed to the form (19) without 
destroying its unitary character. Hence all irreducible representations which are 
equivalent to a unitary representation are those for which the right-hand sides of 
(18) and (19) are real. Thus for a representation to be unitary it is necessary and 
sufficieni that I = /ij be real and 

The condition (20) is actually effective only when j = 0. 


(20) 



377 


Infinite irreducible representations of the Lorentz group 


3. The spinor matrices 

To proceed further we must now introduce spinors. For the moment consider ordy 
the rotation group 8^. Then, it is well known that with every vector of the three- 
dimensional space there can be associated a symmetrical spinor with two indices. 
Thus to the vector K = {K-^, K^, Ki) corresponds a spinor definedf by 

= K^t = K- ( 21 ) 

As usual the antisymmetrical spinors e“^ and = ^ 12 = 1) will be used for 

raising and lowering the indices according to the rule 

= a“€„^. (22) 

Now it is well known that K^p{k) (which is the matrix representing in fUj.) 
can be expressed in terms of the spinor matrices ujfc), v^{k) introduced by Dirac 
( 1936 ) and Fierz ( 1939 ). ufk),ufjc) have 2&+1 rows and 2k columns and vfjk), vffc) 
have 2k rows and 2 ^ + 1 columns. Their properties may briefly be summarized as 


u^(k)v“{k) = 2k, t^{k)u^{k) = 2k+l, (23a) 

+ i) «“(*) = *’“(^) + i) = 0, (236) 

v“{k) = u^(k), ujjo) = v^{k), (23 c) 

vjfi + i) u^{k +■!) - Ufiik) vjk) = (23 A) 

K^p{k) = ujjc) Vf{k) -ke^p = vjjc + 1) Up{k -l- ^) - 1 - (i -l-1) e„^. (23e) 


The bar denotes Hermitian conjugate. It is easy to show that on account of the 
Hermitian character of ILjUjfc) and Vfi(k) can always be chosen to satisfy (23 c). 
Notice that (23 c) is a proper spinor equation, since for spatial rotations a lower 
spinor index transforms the same way as the complex conjugate of an upper spinor 
index. 

Now define two matrices (7“ and F* as follows: 


(jfc I I i:- J) = «“(&), {k-l\V»\k)^ v^(k), (24) 

aU the other elements being zero. Then clearly (23) can be written as 

C4F» = 2Jfc, F“i7„ = 2(i:+1), (26a) 

£417“ = F“^ = 0 , (266) 

F“=C4. ■ (25c) 

= (25d) 

= UJfl-ke,p = F.17^+ {k+l)e,p. ' (26e) 

Here k is the diagonal matrix whose component inside 9i*. is simply k' times the 
unit matrix. Notice that jfrom (24) 

F“/fc = (/5:+i)F“ (26a) 

U‘‘k = {k-i)U^ (266) 


t Spinor indices are always denoted by Greek letters and tensor indices by Latin letters. 



Harish-Chandra 


378 


Now construct similarly a spinor corresponding to the vector L, 


■i'la — •i'21 “ 


= L 


22 


-L+. 


Then (6) can be written as 


[■K«y}, ==jsS S €xaK^fii 

(afi) (.Xu) 

~ ^X/tl ~ k ^ 

iap) (A/i) 

[L„^,Lx,]==-i 8 8ex„K^p 

{afi) {\p) 


(27) 

(28o) 

(286) 

(28c) 


where 8 denotes that terms arising from the interchange of a and ^ are to be added. 

= -JL“^4^ = L^ (29a) 


= (KL) = L 

Now it follows from (10) and (19) that 


(296) 


jfc(i! + l) 


•^ayj + 


i IJtMttlL l(iA.^\uu 


As before, the matrix square root 


1)(2A+1) 

{h-j){k->r3) 


{k-3) (*+j) 




(30) 


IJtz 

V (2fc- 


1)(2A+1)/ 

is always taken to be positive 


•1)(2A + 1) 

while defined arbitrarily. To verify (30) observe that (286) follows from 

the spinor character of while (28 c) is easily obtained on using (30) and (24). 
Similarly (296) and (15) are also easily verified. This fixes the j and /t value of the 
representation uniquely, and therefore as shown in § 2, (29) must lead to the same 
representation, apart from equivalence, as (18) and (19), i.e. if ujjc) and vj.k) are 
suitably chosen the two representations must coincide. 

Now put I^fi = \{K^p-iLap), '(31 o) 

and notice that = i 8 8 (316) 


= 8exJ,^, 

(«/?)(A>.) 








(31c) 


Consider the product space 9i x SRj, where fftj is a two-dimensional space trans¬ 
forming according to the fundamental spinor representation of the Lorentz 
group. The components of a vector in this space may be written as "(jr^ {oc == 1,2), 
where and are one-column matrices enumerating the components in 91. With 
respect to the index a, transforms as a spinor. Now dftfina 



(32a) 


(326) 



379 


Infinite irreducible representations of the Lorentz group 

Then&om(26) ■ (32o) 

Now put = (33) 

From (31 a) and (30) = a{k) + b{h) UJJ^ + VJj,b{k), (34a) 

wh«™ Ki,).y-|r2i(|g^y(n.g). (846) 

On using (26), (26), (32) and (34) one finds that 

= «(A + i)(* + |)f+&(i' + i)V{2i!(2* + 2)}^^^ (3Sa) 

and similarly 

f*'=?7^^P^li^A = -«(*-i)(*-i)?^^*-6(*+i)V{2*(2^+2)}^- (366) 

(36) may be looked upon as a linear transformation in the two ‘variables ’ rjr and tjr*. 
The eigenvalues of this transforroation are easily obtained on using (346). They are 
1(1 ± y), where y is a complex number defined, except for an ambiguity in sign, by 
the equation 

y^ = {j-iliY. (36) 

il 

Write yj, 4 .j = & + and put 

= fiyk+i + 7)t+^iyk+i-7)t*> (37a) 

V2 fy+i = V(rfc+i -y)f+^{yk+i+y)t*> (37 ft) 

the square roots being so chosen that 

V(r.+i+r)V(yfc+i-7) = V{(*^+l-.?)(*+Hi)}y(i+(j^*)- (37c) 


From (36) and (37) f2^y_^sf{yj^^+y)if''+f{yi^i-y)ir*' 

= U^+y)f^fy-v (33<*) 

Similarly f2 =f{y]c+i -y)t'+fiyk+i+y)f*' 

= i(l-r)V2V^r+i- (33^) 

Thus the transformation (37) brings (36) to the diagonal form. Also it follows from 
(37) that 

f2y^ = V(rft+i+r) fy-x-fiyk+i-y) ty+v (39a) 

^J2yi^* = -^{yt.+i-y)fy.^+^J(yk+i + y)fy+v' ■ ( 396 ) 

Now corresponding to (36) define y* by the equation 

y*’^ = {j+i/iY, (^9) 



380 


Harish-Chandra 


and write y = 2/:+l,7* = 2/c* + l. Then 

y — yT 


il = 


= k *( k * + 1 )-/ c («:+ 1 ) “ (k*-k){k* + k+1),'^ 


rfc+i-7 = 


(k+ir-{K* + ^r+{>c+i )^ 

7k+i-k + i 

„ (fc + K+1)®—(Af* +{k + K + K* + ^){k + K — K* + ^) 

Heno» r„+r----, 

{1c—k)‘^—{k* + ^)^ _ {k—l — K—IC*){k + ^~K+K*) 

~k+i (*+i)'“■ 

Put Then it follows from (37) and (32) that 

= as“(/f,/c*)^„, 

= ^“(K + iK*)^^, 

where SS" and U* are defined by 


1 


(41) 


(42a) 

(426) 


(43o) 

(436) 


k{k+i) 


S8«^(jc,;c*) = iF“ y[ 


{k + ^--K + K*)(Jc—\~-K — K*) 


k{k + i) 

1 ll{k+l + K+K*){k + K — K*) 

■‘‘iVi k{k' 


k{k + i) 


I U«. (446) 


Pnrther from (32 c), (39) and (44) it is found that 

(2< +1) V^a = Ua('<^. ic*) fk-i - SS«(«:+ h K*) (45) 

When 7 = 2/r+1 + 0 the two eigenvalues ^(1 ± 7) of (36) are distinct and hence the 
two ‘eigenvectors’ and are linearly independent. Now from (43) and (46) 

{2K+l)i/r^ = «„(/c,Ac*)S8/'(«./c*)5ir-^-SJ„(/c+i/c*)W(/c+|,/c*)^^^, (46a) 

{2K+l)f^_^ = ^li»{K,K*)U„{K,K*)t^_^-^8‘‘{K,K*)f8^[K + ^,K*)f,,+^, (466) 

(2^+l)45'K+i = ll“(«+i.«*)H>,/c*)i^^,_*-U“(K+i/c*)S8„(/c+i/c*)^^,^+i. (46c) 

Also from (33), (38) and (43) 


so that from (46) 


(k+ 1) = SS«(k, k*) IJfft, 

-'qi^.+i = U“(/c+i/c*)I/^^, 


(47a) 

(476) 


(2/C+ \)IJfp = (/c+l)UjA:,/c*)f,_i+^58^(«+i,/c*)^,+j 

= (A:+l)U„(A:,/c*)25'’(A:,/c*)^^^+ic3SJ/c+J,A:*)tl^(/c + ^,/c*)f^. (48) 



381 


Infinite irreducible representations of the Lorentz group 

Now is completely arbitrary. Therefore from (46a) and (48) 

(2/c+l)e/ = U^{k,k*)W{k,k*)-^^{k+^,k*)W{k+^,k*), (49a) 

(2k+1)IJ = (A:+l)U„(/c,Af*)SS>’(A:,A:*) + A:9S„(Ar4-i,Ar*)ll'*(A:+'|,A:*). (496) 

Now when y = 2/c+l4=0 the transformation (37) is non-singular since it has an 
inverse, namely, (39). Hence the correspondence between and the pair 
^ I'l correspondence. Thus every arbitrarily chosen pair ip'K+i) 
corresponds to a unique ijr^, and therefore (46) and (47) hold for arbitrary and 
if 2a:-I- 1 +0. This implies that 

3S“(/c, /c*) 1 I„(a:, k*) = 2/c-i- 1, i8“(A:, k*) SS„(/c-f I, a :*) - 0, 
VL‘{K+i,K*)U,(K,K*) = 0, U‘^{K + hK*)^,(K + i,K*) = -2 k-1. 

After a little algebraic manipulation all these relations can be expressed as follows: 

ll„(yc,/f*) aS“(A:,/c*) = 2/c, i8“(/c,A:*)lIa(A:,A:*) = 2x4-1, (60a) 

U„(x I, K*) U*(x, K*) = SS“(x, X*) i8«(x 4- i X*) = 0, (50-6) 

aSa(x4-i,x*)lI^(x4-i,x*)-U^(x4-i,x*)S8„(x4-i,x*) = e^„, (60c) 

■4/J = = 9S„(x4-i-,x*)U/x4-i,x*)4-(x4-l)e„^. (60d) 

For the exceptional case x = — J (60) can be verified directly from (44) and (34). 
Thus the analogy between (60) and (23) is complete. The following relations are also 


worth noting: 

U“(x,x*) = tl“(x,-x*-l), s8“(x,x*)=^S8“(x,-x*-l); (61a) 

, aS“(-x-ix*) = U“(x,x*), lI“(-x-ix*) = aS“(x,x*). (616) 

Now put C7, = F“ = V«=U„ = iVi. (62a) 

The equations (26a) to (26d) then hold unchanged with dotted indices provided these 
are raised and lowered by means of the spinors = ^12 = 1)- 

Also = (626) 

= L^fi, (62 c) 

Put = (63a) 

Then [VA,a-». (“*) 


/y*2 — J 

= i(J’“+2iJ) = l{5+iiiY-\ = 

= x*(x*-|-l). 


(63d) 



382 


Harish-Ohandra 


Now consider the product space x Sftj where Oijis a two-dimensional space whose 
vectors transform as dotted spinors under transformations of the Lorentz group. 
As before, the components of a vector of the product space may be written as 


X uo '1 

(64a) 


(646) 


(64c) 

As before, consider and construct ^', ijr*'. Write 7^4,4 == 

put 

* + H^^|and 


V2v!^y.+i = V(r*+i-r*)i/^+V(rfc+i+r*)i^*. 


the square roots being so chosen that 

= V{(*+i-i) iJ‘+h+j)}J{^ +|i) • 

From (56) one finds that 

V2 =11(1 +y*) ^2 ^2 = ^(1 - y*) ^2 

Put and write 


fS“(K,K*) 



(k+K*-~K) 


) 


1 //(A + i-/f*-l-«:)(A-i-/c*-Ar)\ „ 

+W(- mw - r 


(66a) 


(«^iir-£L-^.)) 

V \ fe(AJ-h’l) / 


-iy{ 


(k-j-l+K + K*){k + K* — K)'\ 

k{k + ^) I 


then 


= B“(k, K*) 

= U“(/c,/c*+J)^if^, 

(2k* + l)f^ = n^(K, K*) - S8i(/C, i) 

Correspondir^ to (50), one gets 


U^ (666) 

(67 a) 
(676) 
(67c) 


Xl^(/c, K*) SS'^(/f, K*) = 2k*, fQ^(K, K*) ll^(/c, /c*) = 2k* + 1, (68a) 

K* + 4) lt“(/c, K*) = i8«(/c, K*) S8^(k, /c* + -I) = 0, (686) 

58s(ac, k* + i) U//C, /c* + i) - U;j(/c, K*) 8Si(K, K*) = (58c) 

■4,^ = 'c*) SS/C'f, K*) - K*e^^ = as,i(/c, I) U^(^, ^* +1) + (/c-*- +1) e„^, (68d) 




Infinite irredwible representations of the Lorenz group 383 

and U“(/c,/c*) = U“(-K-l,/c*), S*(/c,/c*) = ■ (69a) 

U“(/c, - AT* - J) = /c*), W{k, -k*-\) = n^{K, K*). (59 b) 

Now let the numbers y and y* be given for a particiilar representation. Then from 


(36) and (40) 


0 = 


ey+e*y* 




ey 




^*2- y 2 


(60) 


where e=±l,e* = ±l, the two ambiguities being independent. Since 2j is integral 
and > 0 it follows that at least one of the two quantities y±y* must be an integer. 
In case only one of these quantities is integral the value of^‘ is completely determined 
from those of y and y*. This determines e and e* and therefore /t except in the case 
wheny = 0. But even in this case 1 and are completely determined. Since only 
j, I and /t* enter in the representation it follows that the representation is completely 
determined by the pair (y,y*)- However, when both y+y* and y—y* are integral 
and distinct there are two possible values of j, say y+ and j-, such that y+ >j~. 
Therefore in this case there are two distinct representations corresponding to the 
pair (y, y*). They may be denoted by S respectively, corresponding^to 

they valuey+.andy~. Thus a given pair (k, k*) completely determines the irreducible 
representation unless k, k* are both of the form («,+ — 2). It is easy to see that 

is always an infinite representation. finite only ify“ andy+ differ by 

an integer which is the case when both k and k* are of the form (»H= — !)■ Since 

reversing the sign of y or y* does not alter the representation the four pairs (k, k*), 
{-k—1,k*),(k,—k*-1),{-k-1,-k*—1) determine the same representation. 
From (60) it follows that at least one of the two quantities k—k* and /c+/c* +1 is of 
the form It may therefore be fixed for convenience that k, k* are to be so chosen 
that /c—AT* is an integer or half-odd integer. This is the only condition to be imposed 
on the choice of k, k*. 

It is useful to examine the procedure which led to the matrices U*, and 33^' 
a little more closely. From (16), (36) and (40) 

y2 = J2+l-2iI, y*2 = J^+i+2iI. ^61) 

(61) may be used to define y and y* as operators in terms of K and L even for those 
reducible representations in which and I are no longer represented by multiples 
of the unit matrix. In particular in the product space fft x the infinitesimal 
transformations jt and B corresponding to K and L are given by 

= B = L-)-Jo, 

where o = ((T^, cTj) is the vector constructed from the three Pauli matrices 
(cr^cTa = icTg). Denote by y and the operators in the product space corresponding 
to I and so that 

J' = (BB) = (K-|-Jo)(L-l-|io) = I+^+iA, (62a) 

= B^-B* = (K-h Jo)2-(L-h Jio)2 = ja+|H-2^ (626) 

where .4 = J(K .o) — Ji(L. c). (626) 


Vol. 189.. A. 


as 



384 


Harish-Chandra 


Now notice that J. is an invariant since 


A = 


where and are the infinitesimal transformations of the spaces 91 and SKj 
respectively. Hence A commutes with ft and fl. Therefore in an irreducible subspaoe 
of 91X 9li it must be represented by a multiple of the unit matrix. From (62 c) one 


finds that 


A^ = i(J^-2iI)-A, 


i.e. 


{il + i)2 = i(J2-2i/ + l) = Jya. 


So that the eigenvalues of j 4 are — i(l ± y). If r and F* denote the operators of the 
product space corresponding to y and y* respectively, then 


r *2 = /a + 2iy4.l = J 2 +H- 2 iJ = y* 2 , (63a) 

r^ = J'^-2iJ-+l = J^+l-2iI + iA + 3 = y^ + 4A + 3. (636) 


Hence for the two irreducible subspaoes corresponding to the two eigenvalues of A 
one gets 

• r2 = (yq:i)2. 


Hence one may choose F* = y*, F =y+l, the choice of sign being dependent on the 
corresponding choice in the eigenvalues — - 1(1 ± y) of J.. — .4 is precisely the trans¬ 
formation (the negative sign coming from the fact that the spinor 

component = —-S^b)- Hence it is clear that and transform according . 
to the representations (/c ± x*) and (/c, k* ± |) respectively. In view of this it is more 
logical to write ir{K±^,K*) and if{K,K*±^) respectively. The j values of these 
representations are given by ] x ± ^ - x* | and | x - /c* T i ] respectively, if that of the 
original representation was | /c—x* |. 

Let Kq and xj be any two complex numbers such that 2 (/Cq— x*) is an integer. 
Consider all numbers of the forms /c = /Cq -f- k* = k^ + \n*, where n, n* are 
integers, and construct the matrices U*, SS“, as follows 

{K, AC* m* 1 K - I, AC*) = U*(aC, AC*),' 

(x-iAC*l«“lx,AC*) = SS«(AC,X*), 

(AC, AC*. 1 U“ i AC, AC* - ^) = U“(aC, AC*), 

(AC, AC* - Ifl I AC, AC*) = aS“(A:, AC*),, 

aU the other matrix elements being zero. Then it is easy to verify directly from (44), 
(62a) and (56) that U“ 35“ commute with U^, 38/^, so that, for example, 

U“(AC,X*)35^(AC-iAC* + 4) = 9J^(AC,X* + i)U«(AC*-H|,«). 

Similarly define and by 


Jccfi ^ + 

JA^ = tl“3S^-hlt>^3S*. 


( бб а) 

( ббб ) 



385 


Infinite irreducible represerdations of the Lorentz group 

{k, k* I I /f, /f *) and [k, k* \ | /c, /c*) are equal to the and of the representa¬ 

tion K*. Now, in order that an operator may transform as an undotted spinor 
under Lorentz transformation, it is necessary and sufficient that 

( 66 «) 

[J^^,C!,] = 0. . (666) 

A similar condition holds with the interchange of dotted and undotted indices for 
a dotted spinor 0^. The commutation relations of U“, W and lt“, SS^ thus ensure that 
the former transform as undotted spinors and the latter as dotted spinors. 


4. Unitary representations 


Since I is real for unitary representations it follows from.(41) that either /c* 4 - /c -|-1 
is pure imaginary or else /c* —/c = ±j = 0 . In the first case ( 20 ) is satisfied auto¬ 
matically and K, K* are given by (la). In the second case it follows from (20) that 

‘““1 1 >(,+;.•+ 1 )> . ( 2 ; t + 1 )>, 


so that 1 |, i.e. 0 >Ar = /c*>—1 in accordance with (16). In particular, if 

/c,/c* are both of the form |%(n+— 2 ) thenj+= + |/c*-l-^|and 

±r = k+il - k*+il- 

Therefore is unitary only if |^+i| = + while is unitary 

whenever |/c-l-ii = + since in the latter case 

±il = (|/c+il - |/c*-f-i|)(|/c+il -t- |/f* + il) = 0 
and j — + 0 . By refl.exion I goes over into — / so that a representation is reflexion 

invariant only if / is zero, i.e. /c-|-| = ± (x*-l-^). Hence the representations corre¬ 
sponding to (16) are always reflexion invariant while those corresponding to (la) 
are so only if v = 0 . 


5. Bxpinors 

Consider two complex variables % and which transform as components of a 
spinor under Lorentz transformations. Put 

«{ = %, = . (67 a) 

% = -%. «2 = %,. (676) 

where the bar denotes complex conjugate. Then ui, transform as components of 
a dotted spinor and %, % transform the same way as %, «2 under spatial rotations, 
i.e. under unitary spinor transformations (see Van der Waerden 1932 ). Now 
consider ternos of the type 

•wf 

where A = u-^u^+u^u-^ = = | % |*+1 « 2 1® 


( 68 ) 

(69) 


25-2 



386 


Harish-Chandra 


and p,q,r,s, n are integers > 0 while A is a fixed complex number. Notice that on 
account of (69) the various terms for different values ofp, q, r, s, n are in general not 
linearly independent. To remedy this defect one proceeds as follows. Put 




m 


2k'\ 


jjf i—m! A+m! F—m'! &'+! 

j = k'-k. 


(70a) 


(706) 

{70c) 


where 2k, 2k' are integers ^ 0 . Jf can take the values — ~K and similarly 
m and m' the values k,k—l,...,—k and k',k' — 1,—k' respectively. The sum- 
mationin(70a)isoverallvaluesofmandm'suchthatm+w' = M. It is well known 
(see Van der Waerden 1932 ) that every polynomial ^ which is homogeneous in 
each of the two sets of variables n and 4 and has the degrees 2k and 2k' respectively 
can be expressed uniqudy in terms of as follows: 

K-v 

s S (71) 

0<»^2fc,2fc' M =-K+v 


V runs through all integral values in the range 2k, 2 fc' > v > 0 and „ are constants. 
Since every polynomial in u and u can be split up into a sum of homogeneous poly¬ 
nomials the expansion (71) is always possible and is unique,. One can therefore 
express the numerator of ( 68 ) in the form (71) and arrange the result according to 
descending powers of A. Now consider a series of terms of the type ( 68 ) which is 
homogeneous and of degree A—j and X+j in u and u respectively, where 

2j = r+s—p — q 

is an integer. Then it follows fi:om (71) that it can be put in the form 


00 K 

S S 

E>|3l M-—K 




(72) 


where a^^'are constants. K runs through the values | j |, | j [ -(- 1 ,..., 00 . Now apply 
a spinor transformation to u^ and u^ 


u^ = a/<, 


Up, = 


and as before put 




(73) 

(74) 


Then [u, u) goes over into a polynomial in u' and u' of degree K -j and 

K+j respectively, which from (71) can again be expressed linearly in terms of 
d'-crfwhere d' = | < | < and U'^’^ = U^^(u',u'). Further 


d = 1 Ml |2-t-1= I ai^wi-l-01*24 |*-l-1 a^W^+a^^u^ |* 

= II a II (1 < P +1 Ms 1*) - (1 ai^Mi-f-ai*Mi |* -t-1 j*), 


( 76 ) 



387 


Infinite irred/ucihle re/preseitdaiions of the LoreMz group 

where ||a Ijs [ 01^1®+1 i <5^2^!®+1 “a? p> 0. Thus 

where D' = | (x^v ^+1 a^u'^ +a 2 ®« 2 1® 


and 


0 <: 


D' 


:a|M 


-,< 1 . 


(76) 

(77) 

(78) 


since J,A'>0, if one excludes the case = 0. Therefore one can expand 

jjj ^72) by the binomial theorem in ascending powers of D'/H a H A'. Now D' 
is a polynomial in u' and Hence the product of its powers with can 

again be expressed linearly in the form (71). Ignoring for the moment the question 
of convergence, which will be examined presently one may collect the coefficients 
of U'§'^IA'^-^ together and write 


y 


S 

s:,M 






'M 


(79) 


where are the new coefficients. The numbers A—j and A+j being the degrees 
of the series in the variables u and u respectively, are clearly left unaltered by the 
homogeneous linear transformation (73). Hence only terms of the tj^e I 
can occur on the right-hand side of (79). The linear transformation 
the coefficients thus constitutes a representation of the Lorentz transformation 
associated with (73). 

To examine rigorously the validity of our considerations make the substitution 


= p cos ^ 6 ui — p cos J d (80 a) 

U 2 = p sin p sin \d (806) 

where p, X, $5 are real andp>0,7 r>^>0 , 27r>^>0, 4o7t>x>^- Puto; = cos^and 


Then (72) becomes 


where (x) = 


f{2n) ^ ~Aj\K-MlK+M\2K + ir^ ' 

(81a) 

CO K 1 


K>\f\M - K 

(816) 


(- 1 )^+/ f/K^Ml (X+i) \ 

2 ^ ^\K+MlE+jlK-j\) 


The functions = 


ak+m 

X (1 (81c) 

1 


P^^ (a;) are orthogonal and normalized, i.e. 

1 if K^K^M^M'y 
otherwise. 


V( 2 ^) 

r^TT __ li 

•/^=0 J a5=*—l lO 


( 82 ) 



388 Harish-Chandra 

For convenience arrange these orthogonal functions in a sequence 

Q^{v= 1 , 2 ,..., 00 ) 

and denote the interval - l<a:<l,0<^<27rbyG. Then (82) can be written as 

■ j^Q^Q,dxd<f> = (83) 

00 

Denote a series ’'dU be called the coefficients of .4. It is important 

1 

to note that A is not the sum of the series since this sum may not exist in general. 
A is merely an abbreviation for the infinite sequence of complex numbers A„ 
{v — 1 , 2 ,..., oo) which completely characterizes the series. Two series A and B are 
to be regarded as distinct unless A„ = By for all v. If c is a constant cA denotes 
a series with the coefficients cAy. Similarly 4. 4 - £ stands for the series with the 
coefficients 4„+4 = 0 is the series such that 4„ = 0 for aU v. The norm H 4 |1 of 
a series 4 is a non-negative nupaber defined by 

\\A\f = i\A\K (84) 

1 

Clearly |14 11>0 unless 4 = 0 when 114 || = 0 . If the series on the right-hand side 
of (84) is divergent we say the norm is infinite. 

Let/(a;,^) be a continuous function of x and ^ defined in G. Then by the norm of 
f is meant the number \\f H > 0 given by 

11/11* = f |/l*di#. (86) 

J O’ 

N 

Now denote by the partial sum of the series A, A^ is a continuous 

X 

function of x and (p. Hence from (85) and (83) 

= ( 86 a) 

so that llJ.|P = lim||-4^|P. ( 866 ) 

iNT-J^oo 

This rather natural connexion justifies the use of the same term norm for both A 
and/. Now it is well known (see Courant & Hilbert 1931 ) that the functions Qy 
form a complete set in the sense that for every continuous function defined in G 
there exists a series A such that 

lim!l/-J[^ll = 0. (87) 

iv->oo 

Clearly this series is umque, since if 4 and B be two such series then by Schwarz’s 
inequality which clearly holds for nnr mg 

114 - £ 11 = lim 114^- 11 ^ lim {11/- 4^ H -H H/- B^ H} = 0 . 

isr->op iv^oo 



Infinite irreducible representations of the Loreuiz group 389 

Hence A = B. It can be proved from (87) and (83) (see Conrant & Hilbert 1931) 
that 

A = ] QJdxdtp = {Q„,f) (88a) 

J o 


and 11/11= IMII, (886) 

so that A has a finite norm. Now consider (816). Write it as 

p^e-^^xA. (89) 

Apply the spinor transformation to the m’s and introduce the four real variables 
corresponding to by equations similar to (80). Put z' = oosd'. 
Then p,x,x,<f> are functions of p', z',^'. Noliice that 



*+ % 



< 

1 ®+ < 

^ ^ f t 



and so p^jp'^ and are functions of x' and (j>' alone. Obviously the same is true 
of X and <f). Therefore 

piXg^-iixA^ = 2 A,Q,(a:,^)j 

=p'^^e-Wf^{x',^'). (91) 

Clearlyis a continuous function of Now put 

= f (92) 

JG 

and denote by the series whose coeflBlcients are Ay^j^, Prom the completeness of 
the set Qy it follows that 

li“a 11/jv-A^ 11* = lim f \fN(x',^')-A'^(x',^') \^dx'd^' = 0, (93) 

71-^00 U’^oaJ G 

■where A'^(z'= h Therefore 

P=1 

II -^N+p “■ II = J™ II ^N+p ^'n II 

n->oo 

< lim {ll/iV+p~-4i?+3J 11+ ll/v“-^i?1l+ li/iV+3J—/v ID 

, n,“>oo 

= ll/v+I>“/^? II 

from (93). But from (91) 

/•l/^ 2 \A 2W+p 2 

ll/iv^^-/7.1P = J^|(^2) ^SA,(2,(*,«5) dx'd<l>'. (95) 

Now since the determinant of any spinor transformation is unity it follows that 

d{p',x\f,e') - d{p\x:,<l>',0')'^^ 


(96) 



390 Harish-Chandra 

(Here the usual notation for the Jacobian has been used.) Therefore 

o{p,X>9>^) 

nr Hp,X>«4) _P^ 

d(p',x\^';i>') P^' 

But since pjp', x and <f> are functions of x' and <j>' alone 

d(p',x',^',f) p'w,<i>'y 

«,th»tfrom(97) 

Therefore, taking x,^ as the independent variables in (96), one gets 
s r 12 

ll/w.+^-/vlP = I S AQ.(«,?5) dxd^. 


But from (76) 


J7<lla||. 


Similarly by considering the inverse transformation a“^ one finds that A 'jA < H a~^ ||. 
Hence ^ 

•ppq|<^<lla||- 

Therefore | (.d/J')^+^ is bounded for every fix^ value of A and a, and so 

/• |JV+3? 2 

ll/iv+a,-/vf<<^(a,A) dxd4>, (99) 

where C{a, A) is a real positive constant depending on a and A. Hence from (94) and 

00 

||.4^v,-^J^l|<0'(a,A)SMr|“ {N,N'>No). 

N, 

00 

So that from the convergence of 21 |® it follows that 

liml!^^^.-4^^jl = 0. (100) 

" N-^oo 

Therefore tends to a limit -4^ as ->cx) in such a way that if -4' be the corre¬ 
sponding series then T „ „ 

^ ^ hm\\A'^A'^\\^0. (101) 

« N—^co 

Also clearly 

^ ** /* T / /f V A.+1 iSi 

J/jy(a:',95')|®da:'d^'= \A^{x,^)ydxd(^ 

< C{x, A) 2 M, 1" < C(a, A) 11 A 1|*. 

V=sl 



Infinite irreducible representations of the Lorentz group 391 
Therefore || A' || is finite. A' has the property that 

lim I \p^^e-^^r.A^{x,<j>)-p'^e-^'^^'A'^’[x',<l>')\^dx'd4'= (). (102) 

N-^CO J Q 
N'^00 

This follows from the inequality 

11 /^-^'” MIno-^X \\+\\A'--A'l 11 +11 

< \\fN,-^X 11 + \\A'-A'^, 11 + 11 /^-/^, 11 . (103) 

By choosing «.„ and Nq sufficiently large all the three terms on the right can be made 
arbitrarily small for n'^n^,N'^N^. On going over to the variables x,^m ( 102 ) and 
using (98) one finds that 

lim I* \p'^&-^^r.A^{x,<j>)-p'^e-i^>i:A'^'{x',^')\^dxd6=^0. (104) 

iV-^oo J Q 

N'->co 

Thus ( 102 ) implies (104) and vice versa. Also it is clear that A' is the only series 
satisfying ( 102 ), since if were another such series then 

11 < l|/^-^'» ||+Jl/iv--B'” II, 

50 that on making N and n tend to infinity 

\\A'-B'\\ = lim II11 = 0. 

w ,-+-00 

Hence A' == B\ The transformation A A' is thus uniquely determined by the spinor 
transformation a. Clearly it is a linear transformation in the sense that if A -> A' 
and jB jB' then A + H A' + jB' and cA -> cA', where c is a complex number. In fact 

it is a linear transformation of the Hilbert space consisting of all sequences for which 

00 

51 -4„ is oonvergeut. These transformations thus form a representation of the 
Lorentz group. Also notice that 

/^\A.+l|2 

I \A^M)\^dxd<^, 

so that if A +1 is pure imaginary or zero then 

\\A'^r=\\A^\\K 

Therefore \\A' 1| = 1 | A 1 |, 

since hm | HA' H- \\A'if\\ 1 < fim i|A'-A';^ll = 0 

JV~>oo iV'-^-oo 

from ( 101 ). Thus the norm of the ‘vector’ A of the Hilbert space is unchanged by 
the transformation. The representation is therefore unitary. Besides these unitary 
representations there are others corresponding to the ease^‘ == 0 , A real and [ A 4-1 j < 1 
(cf. (16)), but they do not leave the norm H A || invariant. 




392 


Harish-Chandra 


To obtain the same matrix representation as that given in § 2 one has to choose 

/Z-A-1!\^e.^ '(106a) 




as the basic vectors. Here z! = r{z +1) for any complex number z. In the exceptional 
case -where A is an integer Z—A—1! orJC+A+l! may become infinite. Then one 
can make use of the relation 

r(z)r(i-z)= ^ 


SIELOT 


and ignore constant factors (including l/siuTr = oo). Thus one gets the alternative 


and 


V(A+1+Z!A-^:!)’ 
^{K-X-V. -A-2-Z!)(2§^e-^V^. 


(105c) 


(105a,6, c) respectively are applicable to the three cases \ A+1 [, A> \j \ and 

— A> IJ1 + 2, A being an integer. Then (1056) and (105c) are finite representations. 
(105c) corresponds to an expansion in ascending powers of 1/J and ignoring all 
powers higher than l//d“<®^+^), i.e. calculating the expansion mod l//l-<*^+*\ 

The theory developed so far for functions of the two variables x, if> can easily be 
extended to those of the three variables x, x defined m the interval H given by 

— l<a;<l,0<^^27r,0<;t<4jr. Put 

= (106) 
and denote a series 2 2 

v=l j=-oaK>\j\ 


by SI. The summation for j is over aU integral and half-integral values. Put 


Let/(a;,^,X) be a continuous function of x,<f>,x defined in the interval H. Put 

11/11"= f \f\^dxd4>dx. 

ja 

Now it is well known that the set is complete and therefore for every continuous 

^ -there exis-ts a series S[ such that 

lim II/-1^11 = 0, (107 a) 

^ iV -^00 

where 31^ = 2 Also 

I1/11"=I121||2= limllSl^lP, 

N->-m 

=(«,,/). 


(1076) 
(107 c) 



Infinite irreducible representations of the Lorentz group 393 

where the symbol {gj) is defined for any two continuous functions g and / by the 
equation 

{gj) = (Kg) = f gfdxd^dx^ (los) 

JjBL 

Let 21 be a series with finite norm and let / be a continuous function defined on 
H. Then/is bounded, i.e. there exists a constant C such that |/| < (7. Clearly/2l^ 
is a continuous function. Hence there exists a series satisfying the equation 

lim 11/21^-93^11 = 0. 

n-^co 

Now IIII< 11 +11/21^-585^ II + ||/2l^-/2l^“II, 

and so■ II58^-1| < ||/St^-/Sl^« ||. 

i.e. l|58iv-58^.ll<C2S|2t,P 

No 

Therefore 93jv tends to a hmit 93 as N^co and 1| 93 H is finite, since 
1193112= lim f \f^^\^dxd<^>dx^C^m\\K 

N-^ooJ H 

Hence we conclude that 93^ = lim (/Sy,/2t^). (109) 

2V->oo ^ 

Now let D be any one of the three operators d/dx, djd^ or d/dx and let f{x,<p,x) ^ 
continuous function whose three partial dififerential coefficients exist ^nd are con¬ 
tinuous everywhere on H. Also assume that 

f{x,<j>, 0) = /{a:, <j>, 4m) and f{x, 0, x) = /(«, 27r, x ± 27r), 

where the sign in ^ ± 27r is determined from the fact that the resulting value must lie 
between 0 and 27 r..Then it is clear by partial integration that 

{8,,Df) = -{P*8J), (110a) 

where D*8y is defined by D*8^ = 08^. (1106) , 

But it is weU known that D*8y can be expressed? as a finite linear combination of the 
8;s, i.e. 

D*8, = 'E8^i8^,D*8,), (111) 

<r 

where (8^, D*8y) is zero for all except a finite number of values of cr. Hence from 

(110a) and (111) • _ 

{8,,Df) = --E{8^,D*8,){8^,f) 

cr 

= S(HCT)(« 

cr 

=^'L{8,,D8,){8„f). 


( 112 ) 



394 Harish-Chandra 

Since Df is a continuous function the series 33 with 83, = Df) ‘represents ’ Df, i.e. 

]iinl|D/-83»ll = 0, 


and therefore 


\^\\^ = jjDf\Uxd^dx 


is finite. Now (112) shows that the coefficients 95^ can be obtained by differentiating 
the series 21 representing / (21^ = {S^,f)) term by term, expressing DS^ as a linear 
combination of Sj, and collecting coefficients of S^,, Thus if 21 ‘represents’ a con¬ 
tinuous function with continuous partial derivatives then the series i)2t obtained 
by term-by-term differentiation represents JDf and therefore \\ D2l |1 is finite. 

Now from a series 21 one can always go back to the series of the type (72) by making 
use of the relation. 


formally put 


00 00 

f{u,u)= S S 

j - ooK>\nM ^ 


EM 

■ 


(U3a) 




<1136) 


.«d write 

= iistr. 

Por brevity we say that ^ is a continuous function of u and u if there exists a 
continuous function/(a:, defined on H with 

fix, 0) = fix, 4>, in) and fix, 0, x) = /(», Stt, x ± 27r) 
such that 31, = i8„f). 

Such a function, if it exists, is uniquely determined by 91, since if / and/' were two 
such functions then 


11/-/'IK lim 11/-91^ 11+ lim 11/'-91^ 11 = 0, 


iV~>oo 


N~^co 


so that 


II/-/' IP = jjf-r \^dxd<fdx = 0. . 


But as / and/' are continuous this implies that / = /' everywhere on H. Similarly, 
if / has got continuous partial derivatives everywhere on H we say ^ is continuously 
differentiable. Hence the above result can be expressed by saying that if ijr is 
kontinuously differcTdiable it can be differentiated term by term. 

Notice that if ^ is continuous in one Lorentz frame, then it is continuous in every 
other. Since if 21 is the corresponding seriea and 21' its transform in any other 
Lorentz frame then 


and 


lim f \p^Wix,<f,x)-p'^W^'ix',f)',x;)\^dx'64'dx' (114o) 

V->oo JH 

lim r \p^W-p:^f\^dxd<f)dx = (i. (1146) 


JV" 



Infinite irreducible representations of the Lorentz group 395 

where / is continuous. Define 

/'(«'.9^'.a:') = f{^.4>>X), (114:c) 

then clearly/'(a;, $5^ ;t) is continuous, and as is easily proved 

lim l!/'~9l'^|| = 0, (114^) 

N-^oo 


Therefore the transform u) is continuous. The same is true for the property of 
continuous differentiability. In particular, if ijr{u, u) is^such that in some particular 
Lorentz frame it has only a finite number of terms in the series, i.e. 31^ = 0 for all 
v>N then clearly ijr{u,u) is continuously differentiable an arbitrary number of 
times. 

One can now introduce the operators djdu^, Uj^ and djdu^. djdu^ is defined by 

d ^ dp d dx ^ dx d d^ d 
du^ ~ dUf, dp dUf, dx dx du^ d^ ’ 


where is ^a) | minor of dujd^ in the Jacobian 

I d{p,x,x,<p) I 


S(p,X>x,^) 


(i’=P,X>^OT^). 


djdu^ is defined similarly, Up d^jdu^ and are clearly continuous functions 

of x,^,x (Q'Part from a possible factor p or 1/p) and therefore from (109) and (112) 
the operators Up djdu^ and 9/9z6^ can be applied to every continuously differenti¬ 
able series i/r. In this term by term application 9/9% corresponds formally to partial 
differentiation with respect to % keeping %,%,% constant. Similarly for 9/9%, 9/9% 
and 9/9%. 

Consider an infinitesimal spinol* transformation 

(115a) 

( 1156 ) 

= 7 !^) applied to the expinor space. From the condition that the determinant of 
the transformation should be 1 so that the fundamental spinor remains invariant 
it follows that 

Let ^{u,u) be a series which is homogeneous and continuously differentiable so 
that ^ = p^^2l. Under (115) ijr goes over to = p^^%* where 31' is defined by (114a), 
Since ^ is continuously differentiable there exist continuously differentiable functions 
f(x,<p,x) and /'(a;',^',X') satisfying (1146,c,d). Write p^f{x,^,x) = ^{u,u) and 
p'^nx',i>',x') = Then 

W'(u’,u') = ^{u,u). 


(116) 



396 


Harish-Chandra 


From (116), (115) and the definition of and d^sdldu^, it follows that 

W'(u,u) = ¥^'(«', m ') + 

= W{u, u) +0^!P+1// di^w 
= (117) 

up to quantities of the first order. Here 

IP' ^ (llSa) 

= (1186) 

Since ^ is continuously (Merentiable the functions If^^W{u,u) and I^^WiUyU) are 
represented by the series and I^ifr respectively. 

Now the Lorentz transformation 


^kl~^’^^lk ‘ (119a) 


associated to (115) is given by 

= -ivJ (1196) 

where the (r-symbols have their usual meaning (see Van der Waerden 1932 ). (1196) 
is obtained immediately from the condition that a spinor transformation applied 
together with the associated Lorentz transformation leaves unaltered. From (3) 
the transformation operator corresponding to (119 a) is 1 —Hence the 
connexion between and is given by 


V = ■ (120) 

On using the usual representation for cr\x fii is easily proved that and Jxp. 
obtained are the same as those introduced in § 3. 

Notice that [9^?'^*'] = d% ( 121 a) 


= \uA, d^l. 


( 1216 ) 


and the dotted operators commute with the undotted ones. Also if ^ is homogeneous 
in the dotted and undotted pair of variables separately and of "degree 2 /c* = A+j 
and 2 /c = A—j respectively, then it follows immediately on term-by-term differen¬ 
tiation that ^ ^ V 

u^dhjr = 2ic\jr^ ( 122 a) 


u 


= 2/c*^, 


(1226) 


From (118), ( 121 ) and ( 122 )iit is clear that the properties of 0^ are the same as 
those of S 8 ^ respectively as defined in (64). In fact the matrix representation of 
0^ is identical with that given for in § 3 if one chooses either of the three 

functions (105a), (1056) or (105 c) for all values of A and J as basic vectors. Similarly 
= 11 ^ and 0 ^ = 



397 


Infinite irreducible representations of the Lorerdz group 

The meaning of (43) and (46) is now obvious in terms of and Let ^ be a 
vector of the space x fftj such that the corresponding pair of series «) are 
continuously differentiable and are of degree 2k and 2k* in u and u respectively. 
Then clearly u‘^'ijrgfu,,u) and d‘‘i/r^{u,u) are two series of degrees (2k + 1,2k*) and 
(2k— \,2k*) respectively and therefore transform according to the representation 

Also notice that the commutation relation ( 121 a) remains unchanged under the 
formal substitution 

ft 

By the same substitution 2 /c = goes over into = — ( 2 /c+ 2 ), i.e. k goes over 
into —/c— 1 . This corresponds to (59). 

6. Physical applications 

The irreducible representations of the Lorentz group discussed above may be 
used to describe the transformation properties of the wave function of a particle. 
Let ^ and be two vectors of the product spaces 9 ft x and 9ft x 9 ft 4 respectively. 
The corresponding series will be denoted by and their coefficients 

being functions of the space-time coordinates —0,1,2, 3). Assume that the 
functions and }P^ associated to these series (cf. § 5) are continuous and dijfferenti- 
able a sufficient number of times with respect to u^, and Then in analogy with 
the case of finite representations (Fierz 1939 ) one can set up an equation of motion 
of the particle as follows: 

i (123 a) 

= ( 1236 ) 

g 

■w^liere 9«X = <^«a9a:. = ^ 

and % is a constant 4 = 0. The second-order equation 

follows immediately. (123) can be put in a more elegant form by making use of the 
formalism of a recent paper (Harish-Chandra 1947 ). Introduce the 4x1 matrices 
7^ and 7^ (see Harish-Chandra 1946 ) and put 

(125) 

Then there exist 1 x 4 matrices and having the following properti^: 

r^ 7 ^ = 0 = r,%, (126a) 

= (1266) 

= 1 , (126c) 

A^^ = 2(7^r^+7^If), (126(i) 



398 Harish-Chandra 

where A^p, = a* being the usual 4x4 Dirac matrices satisfying the commuta¬ 
tion rules aj,aj-l-ajaj. = (127) 

Therefore ir^ = ir*W, (128a) 

and (123) is equivalent to = 0. (1286) 

IP is a 4 X1 matrix, each element of which is a series in a, u of degree (2k, 2k*) whose 
coefficients are functions of Xp.. 

Now choose a representation with A-f-1 = 0. Then it is unitary. For two given 
series ^ and <j) — define the scalar product 


= lim \A^B^dxd4> = S 

iV-»ooJ v=*l 


(129) 


From Schwarz’s inequality 

v=2V'o p=iyro 

and therefore {^, exists and is finite. Also it is easy to show from (98) that (^, 
is invariant under Lorentz transformations. Notice that 


S AB, 

y=^N^ 


and = \\ir\\\ 

The charge and current density corresponding to (1286) can be given by 

= eW^oc^W, (130) 

where IP**'is defined by = iSd, (131a) 

A being a non-singular Hermitian matrix such that 

ocl=Aocj,A-\ (1316) 

(t denotes Hermitian conjugate.) ^is the transposed of W and the right side of 
(130) stands for ^ 

e E 

P»(r=‘l 

The change density is therefore positive definite as in Dirac’s theory of the electron. 
In fact (1286) can be derived from the Lagrangian 

i = i (0^ W*a^W- 0* W) + xW*^ 

in the usual way. The analogy with the case of the electron is complete. Neither the 
energy density nor the total energy is positive, and therefore the theory can be 
quantized only in accordance with the exclusion principle. Write = TJ 

CO 

and u) - Yi^pv^v ^se a representation for for which A an. Then in 

F =1 

the quantized theory the coefficients „ appear as operators satisfying the com¬ 
mutation rules 

s^yx,f)iPp,,(x',t)+F^,(x',«)??,/>(3f>*) = KpKA^-^')- 


(132) 



Infinite irreducible representations of the Lorentz group 399 
Here x and t are the space and time coordinates and Wf ^ is the Hermitian conjugate 

of 

Notice that no subsidiary conditions are necessary to make the charge-density 
positive definite (cf. Fierz 1939 ). Further, this scheme describes particles of integral 
or half-integral spin according as j is half-integral or integral. This follows imme¬ 
diately on reducing the total representation space with respect to the rotation group. 
Hence particles with integral spin must be quantized here according to Fermi 
statistics. Such a state of ajffairs, however, is not possible for finite representations 
(Pauli 1940 ). 

Now instead of (1286) one can equally well take the equation of motion of the 
particle to be 

(133a) 

where are the Duflin-Kemmer matrices satisfying the commutation rules 

+ = 9lclJ3m + 9nah- (1336) 

In this case the energy density as calculated from the energy momentum tensor 

y* = - (134) 

A^ = A . 

is positive definite. The equation describes particles of integral or half-integral spin 
according asj is integral or half-integral. As usual it can be quantized according to 
Bose statistics through the commutation rules 

* (135) 

where = 23^%(y?o%<r and 

P cr 

W' stands'for W(x',t), Electromagnetic interaction can be introduced in (128) and 
(133) in the usual way. Also it is clear that (128) and (133) are both refiexion 
invariant since the unitary representations with A-h 1 = 0 are reflexion invariant. 

So far no subsidiary conditions have been imposed on IP. A^s a result the particle 
thus described does not possess a definite spin in the sense that in the rest system 
the non-vanishing components of the wave fonotion do not all necessarily transform 
according to one irreducible representation 2 )^ of the rotation group. The subsidiary 
conditions imposed by Dirac ( 1936 ) and Fierz ( 1939 ) on ^ and are as follows: 

9“^ = = 0. 

From (126) and (1286) they may be written as 

0/*jn* ^ ^ 0^p* ?P = 0, (ISea) 

or otherwise as — 0. (1366) 


Vol. 189. A. 


26 



Harish-Chandra 


Then as shown by Eierz ( 1939 ) all non-vanishing components of ^ in the rest system 
transform according to S)g with 5 = /c -{- /c* (where /c, at* are both real, ^ 0 and of the 
form "Iw). One can, however, consider this question in a more general way at least in 
the unquantized force-free case. Consider the quantity where 

refer to the total representation space so that, for example, in the particular case of 

In the rest system of the particle 3^ = 0 (fc H= 0 ) and i Sq = ± X so that from (53 a), (52), 
(31a) and (29) 

= ~ +-^22-^22} ^ 

= JX"(K2+L2)!P 
=;t2(K2-^J2)!P 

since in the rest system = s{s + 1 ) if the particle has the ‘ pure ’ spin s. There¬ 
fore + + (137) 

Now (137) is a.spinor equation and therefore since it holds in the rest system it must 
hold in every system. (137) is therefore the condition which must be satisfied in 
order that the particle may possess the ‘pure’ spin s. However, there is no real 
reason for preferring (137) to the simpler condition 

[a«A 3 ;j;i/^/«j_^ 2 (s(s + 1 ) += 0, (138) 

where 4 ”^ = and . = i(«A5/i+«;i^A)- 

(138) implies that in the rest system each element of ?? is a series all whose coefficients 
lying outside the subspace vanish. In the case (128) the elements themselves 
transform according to and with respect to the matrix index and therefore 

the entire set of components of the wave function in the rest system can be reduced 
into two subsets transforming according to under the rotation group. 

Clearly a similar argument is applicable to (133). Now (138) can be written as 

d^d^h[.g^Uxdpd^W-\-x%{f^+K*'-8){K'\-K*-jrl + $)}W = 0. 

Since in our case /c+/c* + l = A-[-l = 0 it follows that (138) is equivalent to 
^ 1) IP = 0, (139a) 

The following equations are easEy obtained from (139a): 

d‘‘^dfi^d^dxV,^UjiW-xMs + l)^= 0, (1396) 

d‘‘^dfi^u^dxd^d^W-x^{s-j)(s+j+l)W = 0, (139c) 

d‘*^d^/^d^uxu^d^ W-+j) {s-j+l)W= 0. (139d) 



Infinite irreducible representations of the Lorentz group 401 

These four equations are completely equivalent. It is easily seen that they can be 
satisfied by imposing any one of the following four subsidiary conditions on Wi 



o' 

1 ! 

II 

(140a) 

d^i^UfUfiW = 0 

11 

11 

0 

(1406) 


(s 

(140c) 

= 0 

(s = -j> 0 ). 

(140d) 


(140a) and (1406) are equivalent since one goes over into the other by the substitution 
/c->—/c— /c*— 1 . The same is true of (140c) and (140dJ). The equation pro¬ 

posed by Dirac ( 1945 ) for a spinless particle corresponds to (133) apd (1406), using 
the five-row representation for 

Notice that (137) or (138) ensure that Wp reduces to a finite series in the rest system. 
Therefore as proved in § 5 it is continuously differentiable an arbitrary number of 
times. In an arbitrary system Wp is in general an infinite series having components 
in every subspace 91;^., This, however, in my opinion, does not justify the interpreta¬ 
tion given by Dirac ( 1945 ), that the particle develops new spins during motion. 
Even in the case of finite representations W has components lying outside the 
subspace in an arbitrary frame of reference. This is usually not interpreted to 
mean that the spin of the particle is altered by motion. 

Finally, I should like to thank Professor Dirac for suggesting the investigation of 
these infinite representations. 


Refebenoes 

Courant & Hilbert 1931 Methoden der Mathematischen Physik, 1 , chapter n. ^Berlin: 
Springer. 

Dirac 1936 Ptoc. Poy. Soc. A, 155 , 447 — 459 . 

Dirac 1945 Ptoc, Roy, Soc, A, 183 , 284 — 296 . 

Fierz 1939 Hdv, Phys, Acta, 12 , 3 - 37 . 

Harish-Chandra 1947 Ptoc, Gamb, Phil, Soc, (in the Pr^s). 

Pardi 1940 Phys, Rev. 58 , 716 — 722 . ^ 

Pontrjagin 1939 Topologiccd groups. Princeton: Princeton University Press. 

Van der Waerden 1932 Die grupperdiheoretische rmthode der Quafdn 4 ,nmiechanik, Berlin: 
Springer. 

Weyl 1931 The theory of groups ar\d quaritum rnechanics, London: Methuen. 

Weyl 1939 The classicaL groups, Princeton: Princeton University Press. 


26-2 



Thermal diffusion in mixtures of the inert gases 
By K. E. Gebw, Heriot-WaM College, Edinburgh 
{Communicated by M. L. Oliphant, F.R.S.—Received 15 August 1946) 


Measurements have been made of the separation produced by thermal diffusion in all the 
binary mixtures of the inert gases (except Kr-X and Rn mixtures) at temperatures ranging 
from —180 to 400 ° C. The temperature dependence of the thermal diffusion ratio, fej.? of 
Rj,{Egt = &ip/[fe2»(oo)]j, where [&j«(oo)]3l theoretical value for molecules behaving like rigid 
elastic spheres) has been deduced. The results indicate that, in general, Rf increases with the 
temperature in the lower range and becomes constant at high temperatures. The constant 
value is first attained at a temperature which varies from mixture to mixture, but its magni¬ 
tude is nearly the same for all mixtures. It corresponds with a repulsive force index of 11 , 
if the molecules are treated as centres of repulsive force only. 

The results have been compared with those calculated by R. Clark Jones (1940, 1941) for 
a gas mixture in which the two molecular species have identical force fields of the Lennard- 
Jones t3?^, with repulsive force index y = 9 and attractive force index y'= 5 . The theoretical 
results give jr as a function of T/T<r, where To is the critical temperature of the gas. This 
relation with the critical temperature is supported by the experimental results, though there 
are difdculties in the comparison due to the non-identical nature of the force fields in the 
experimental mixtures. It appears, however, that the observed behaviour could be explained 
by a Lennard-Jones model if the force indices were suitably chosen, and that a study of the^ 
temperature dependence of in isotopic mixtures would enable the indices to be determined 
with some precision. 


Intboduction 

The work to be described is an experimental investigation of the dependence on 
temperature of the thermal diffusion ratio Tc^ for the various Combinations of the 
inert gases, and a comparison of the results with the theoretical ones of R. Clark 
Jones { 1940 , 194 ^) molecules which have ah attractive field superimposed on a 
repulsive field. 

♦ The thermal diffusion ratio appears in the Chapman-Enskog equation relating 
the concentrations c^, Cg of the molecular species in a binary gas mixture in which 
a temperature gradient exists to the temperature 

Vci = — Vc 2 = log T. 

iy is a complicated function of the masses, concentrations and force fields of the 
molecules. The effect of the fields enters in the form of integrals Q which may be 
considered as dimensionless collision cross-sections, and unless the two molecular 
species have identical fields of force, distinction must be made between the inter¬ 
actions of like and unlike molecules, that is, between and When the mole¬ 

cules have identical fields, as in mixtures of isotopes, only one type of interaction 
occurs. Chapman describes such molecules as ^isotopic’, whether or not*they are 
isotopes of the same element. Prior to Clark Jones’s work fey had been evaluated 
only in the following special cases: 

(i) for molecules interacting like ri^d elastic spheres of different diameters 

<7*1, (Tg; 


[ 402 ] 



Thermal diffusion in mixtures of the inert gases 403 

(ii) for mixtures of isotopic molecules which are point centres of repulsive force F 
varying inversely as the vth power of the distance r, i.e. F = /cr”*'. 

In both these cases is independent of the temperature. 

Experimental determinations of hjt have been made for various gas mixtures 
(Ibbs & Grew 1931 ; Atkins, Bastick & Ibbs 1939 , etc.; Grew 1941 ). A comparison of 
the experimental and theoretical values enables an estimate to be made of the force 
index v, assuming the interaction to be an inverse power repulsion. In many cases, 
however, the value of kji changes with temperature. This is an indication that an 
inverse power repulsion does not adequately represent the molecular fields, since 
for this type of interaction is theoretically independent of the temperature if the 
molecules are isotopic, and only slightly dependent otherwise (Chapman 1940 ), 

As a result of recent experimental work on thermal diffusion in mixtures of 
isotopes (Nier 1940 ), in which also a dependence of on temperature was observed, 
Clark Jones ( 1940 , 1941 ) has treated theoretically thermal dUBFusion in isotopic 
mixtures in which the interaction includes an inverse power attraction superimposed 
on an inverse power repulsion, thus F — Kr~^ — (Lennard-Jones’s model). 
This model is on fitrst consideration unpromising as a means of explaining the 
observed variation of kji with temperature, which is a decrease with decreasing 
temperature; it suggests, indeed, that the variation should be in the opposite direc¬ 
tion. For, considering a repulsive field only, F = /cr”^, it is seen that 

V = —dlogFjdlogr, 

This quantity Clark Jones calls the hardness function For the Lennard-Jones 

model it is , 

fi(v) = V+Q{v- v') + 0(Q^), where Q = 

/c 

For values of r less than r^ (the value of r at which the potential energy is a minimum), 
Q is positive and less than unity, so that fb{v) > v, and by an amount which decreases 
as r decreases, that is, as the temperature increases. It would be expected then that 
k^ would decrease with increasing temperature, contrary to what is observed. 
Clark Jones has shown that this is the case only at high temperatures; at lower 
temperatures the effect of the attractive field is to cause fey ^ decrease as the 
temperature falls, and even to change sign. 

His fullest treatment is for an interaction in which the repulsive index v = 9, and 
the attractive index v' = 5, a model which was chosen because much of the numerical 
work had already been done. His results are shown graphically in figure 1 . Here 
the ordinate is the ratio of fey for the particular model to the value for rigid elastic 
spheres fey(aD). (iZy is more conveniently discussed than fey, since it is roughly inde¬ 
pendent of the concentrations of the molecular species and only slightly dependent 
on their masses, so that its value is determined mainly by the molecular fields.) 
The abscissa is the quantity e/fer, ebeingthe depth of the potential energy minimum, 

1; _ jr' 



K. E, Grew 


404 

and k is the Boltzmann constant. Clark Jones has compared these theoretical values 
with some experimentalvalu.es of obtained by Nier ( 1940 ) for mixtures of the^ 
isotopes ^oNe-22Ne, and The observed values are mean values over 

a wide temperature range, and the maximum value is considerably greater than the 
maximum theoretical value, presumably because the actual repulsive index is 
greater than 9 . Despite these difSiculties the comparison suggests that the observed 
variation of could reasonably be explained with the Lennard-Jones model, 
though the indices (9,5) are not suitable, at least for neon. 



Figube L The variation of Rj. with temperature for the ( 9 , 5 ) model (Clark Jones). The 
abscissa is approximately where Tq is the critical temperature of the gas. 

As a result of this extension of the theory of the thermal diffusion effect further 
measurements of the variation of with temperature are desirable. These should 
preferably apply to isotopic mixtures, since in them only one type of interaction 
has to be considered. But, although for such mixtures the theoretical treatment is 
relatively simple, the experimental difficulties are large, and the work which is now 
described is a measurement of the thermal diffusion effect over a wide range of 
temperature in mixtures of the inert gases—^which presumably form the next simplest 
case to that of isotopic mixtures. 

Some measurements on mixtures of the inert gases have already been reported. 
Atkins et al. ( 1939 ) examined all the binary combinations of He, Ne, A, Kj and 

X, but the effect was measured only for a temperature gradient from 16 to 100 ° 0. 
Measurements over a range of temperature from 0 to —180° C were made by Ibbs & 
Grew { 1931 ) for He-Xe, He-A and Ne-A mixtures. These indicated clearly that R^ 
diminishes as the temperature falls. Since this work was done the inert gases have 



406 


Thermal diffusion in mixtures of the inert gases 


come to be commercially obtainable in a pure state, and larger volumes may now 
be used than formerly. A modification of the original method has been made to 
permit measurements on one and the same mixture over a temperature range from 
-180 to 400° C. 

Expeeimental method 

In general, the method and procedure were the same as in the study of the effect 
in hydrogen-deuterium mixtures (Grew 1941 ). The analysis of the gas mixture 
before and after separation was made by a sm*all thermal conductivity gauge to 
which a sample of gas of volume 0*2 to 0-4 c.c. was admitted from the diffusion appa¬ 
ratus by means of a special sampling cock. 

The diffusion vessel consisted of a large bulb 
A, of volume 61-3 c.c., connected by a tube 
9-5 cm. long and 3*6 mm. internal diameter to a 
small bulb B which formed part of a stopcock, 
as shown in figure 2 . The volume of B was 
2*6 c.c. Diffusion proceeded with the stopcock 
turned so that the short side tube G was in com¬ 
munication with the bulbs. Rotation of the cock 
then spilt the gas in G into the gauge for 
analysis. 

The gas mixtures were prepared in known 
proportions in a burette attached to the ap¬ 
paratus and were then admitted to the evacu¬ 
ated bulbs. The pressure was adjusted to about 
30 cm. of mercuLry at room temperature prior 
to the high-temperature measurements, and 
to about 70 cm. prior to the low-temperature 
measurements. This ensured that at high tem¬ 
peratures the pressure did not exceed atmo¬ 
spheric pressure, and at low temperatures there 
was sufficient gas in the side tube G for analysis. 

The separation was thus measured at pressures 
which varied with the temperature, but theo¬ 
retically the separation is independent of the 
pressure and experiment has confirmed this 
(Grew 1945 ). 

For high temperatmes the bulb A was placed m an electric furnace. The support 
for the nichrome winding was a thick-walled copper tube on which there were 
supplementary windings at either end so that the region near the bulb coxild be 
brought to a uniform temperature. The low-temperature measurements were made 
in two ways. In the first method the bulb A (which was now brought .below B by 
rotation about the ground-joint D) was surrounded by a copper cylinder, a little 


water at 
/ 20 °C 



to gauge 


Figuke 2. The diffusion bulbs. 



406 


K. E. Grew 


longer than the bnlb, on which a few turns of nichrome wire were wound to form a 
heating coil. The cylinder was supported inside a double-walled vessel of which the 
space between the walls could be evacuated. Surrounding this vessel was a vacuum 
yessel containing liquid oxygen. Temperatures from —120 to —180° C were thus 
readily maintained by adjustment of the heating current and of the vacuum between 
the walls of the inner vessel. In the second method, applicable between — 20 and 
—150° C, the bulb was surrounded with pentane contained in a vacuum vessel, and 
cooled, and maintained at constant temperature by addition of liquid oxygen or 
solid carbon dioxide. Manual control sufficed to keep the temperature constant to 
within 0-5° C for the 45 min. or so required for equilibrium to be attained and the 
samples analyzed. Over the range of temperature common to both methods excellent 
agreement was obtained. In those cases in which the boiling-point of the gases 
permitted of it, the bulb was immersed in liquid oxygen for the lowest temperature. 

Temperatures were measured by three copper-constantan couples placed close 
to the upper, middle and lower parts of the bulb A respectively, so that the tempera¬ 
ture variation over the bulb could be measured and a mean temperature found. 
Above 300° C the variation was 4° C; at low temperatures not more than 2 ° C. The 
couple was calibrated frequently, using as reference points the boiling-point of 
oxygen, the freezing-point of mercury, and the boiling-points of water, naphthalene 
and benzophenone. The temperature of the water circulating about the small bulb 
was measured with a mercury thermometer. The temperature was controlled to 
within 0*5° C. 

Mecmrem&rd of the separation. The difference in composition of the gas in the 
large bulb and in the small bulb when a temperature gradient exists in the con¬ 
necting tube is termed the separation. Only the change in composition in the small 
bulb was measured, the separation being calculated from this and the relative 
volumes and temperatures of the two bulbs. Both bulbs were first brought to the 
same temperature and a sample of gas admitted to the gauge, which, as before 
(Grew 1941 ), formed one arm of a bridge across which the potential difference could 
be varied. The potential difference to balance the bridge for the uniform mixture 
was thus foimd. The large bulb was then brought to the required temperature, and 
after 20 min. or more a sample was withdrawn from the small bulb. From the change 
in potential difference required to balance the bridge and the slope of the potential 
difference-concentration curve, the change in composition of the gas in the small 
bulb was calculated. To ensure that equilibrium had been attained a second sample 
was taken later. The separation is then given by 

where s — change in composition in the small bulb; = volume of small bulb; 

= volume of large bulb; 2 ^ = temperature of smaU bulb; = temperature of 
large bulb, and 6 = 0 , correction term for the gas in the connecting tube. 5 is a function 
of the volumes of the connecting tube and large bulb and of the temperatures and 



407 


Thermal dijfusion in mixtures of the inert gases 

In finding 6 it was assumed that the temperature gradient in the connecting 
tube was uniform and that could be regarded as constant throughout the gas in 
the tube. Both assumptions were sufficiently valid for the calculation of the correc¬ 
tion. 6 increases with the temperature T^\ at the highest temperatures used its 
value was 0-013. 

It is estimated that in mixtures containing He or Ne, the error of a measurement 
of the percentage separation is ± 0-03. The gauge is less sensitive for A-Kr and A-X 
mixtures, and here the error may be ± 0-05. 

Preparation of the gases. The inert gases used were aU supplied by the British 
Oxygen Co, The helium, neon and argon were spectroscopically pure. The krypton 
was of German origin and stated to contain 1 % xenon. No attempt was made to 
purify this gas. The xenon was produced in the U.S.A., and, as its purity was not 
given, the gas was fractionated. The gauge readings ^for the various fractions indi¬ 
cated that the gas contained a small amount of impurity—^about 2 % if it was 
krypton and a smaller amount if it was a lighter gas. The residue, which alone was 
used in the diffusion measurements, contained without doubt only a negligible 
amount of impurity. 

RESXJIiTS 

The values of the separation, when plotted against log (^i = constant 
temperature of the small bulb, = the varying temperature of the large bulb), 
yield in some cases and over a limited range of temperature a straight line, 
indicating that the fundamental equation 

dc = &yd(log T) 

may be integrated as if Ay were constant, to give the separation 

5 = C2~ci = AylogTg/^l 

(where c^, Cg are the cquceutrations at temperatures 2\, T^, In general, however, 
the linear relation between the separation and logT^jT^ does not hold if is suffi¬ 
ciently reduced, and it is necessary to consider the interpretation of the separation 
—log T^IT^ curve in this case. 

Ay is a function of the concentration, and for some types of molecular field of the 
temperature also. If the separation is not too large the dependence on concentration 
may be ignored and Ay regarded as a function of temperature only. Then 

5 = Cg —Cl = Ayd{log T) and ds = I Ayd(log T), 

J jTi j 

since T-^ is constant in this method of experiment. Hence 

ds = Ayd(log 22 )- 

The slope of the separation —log TJTi curve at any point is therefore a measure of 
Ay at the temperature corr^ponding to that point. 

The assumption made here that Ay may be considered as independent of the 
concentration in the range of separation involved is questionable, as the separation 



408 


K. E. Grew 


amounts in some of the mixtures examined to 10%. It is, however, not easily- 
avoided, and its effect was therefore minimized. First, the mixtures used were those 
containing nearly equal proportions of the two components and for which therefore 
the variation of with composition is small. Secondly, each observed separation 
was corrected so that it applied to a mixture of mean composition the same as that 
of the original uniform mixture. In the form of apparatus used, most of the chahge 
in composition occurs in the small bulb. This causes the mean composition of the gas 
in the connecting tube to vary with the separation attained, and consideration shows 
that there is an apparent decrease in. dsld(\.ogT^ as increases, since in all 
cases the proportion of the lighter component was less than that for maximum value 
of k^. The reverse effect would occur if the change in composition occurred in the 
large bulb. For equal changes in composition in the two bulbs, that is, for constant 
mean composition of the gas in the coimecting tube, the separation — logTg/Ti 
curve should be linear. Each value of the separation was therefore corrected so that 
it applied to a mixture of mean composition equal to that of the original mixture. 
For this purpose use was made of the curves showing the separation as a function 
of the composition (for constant -temperature ratio) given by Atkins et aL (1939). It 
is these correc-bed values which are recorded and from which the values of k^ were 
derived. That this treatment is satisfactory is showi^ by the fact that for those 
mixtures in which is expected to be independent of temperature (for example, 
He-Ne except at very low temperatures) the separation — log plot is a straight 

line even though the separation amounts to 8 % or more. The apparent variation of 
k^* in other cases is therefore a temperature effect. 

The results are summarized in table 1, where the values of the separation at 
definite values of log are given for the nine mixtures examined.* is the 
ratio of k^ (determined from the slope of the separation —logTJT^ curves) to 
k^{co), this being the theoretical Value of k^ for the same mixture assuming that the 
molecules interact like rigid elastic, spheres of diameters equal to those deduced 
from the viscosity of the pure gases. Expressions for k^fico) have been given by 
Atkins et al. (1939). The value of iJy given in the last column is that found by Atkins 
et al, for a mixture of the same composition; it is a mean value in the range 15 to 100"^ 0. 
There is reasonably good agreement except in the case of He-Ne. The difference 
between the present value for this mixture of 0-64 and Atkins et al. value of 0-80 
is much greater than the experimental error. The cause of this discrepancy is un¬ 
known. In case the gases used had been inadvertently contamina-fced during mani¬ 
pulation, new supplies were obtained and further measurements made. The following 
are the results: 


mixture 

%He 


k^i od) 

jRjt 

1 

36-8 

0-070 

0-104 

0-67 

2 

53-8 

0-080 

0-124 

0-64 

3 

62-5 

0-078 

0-125 

0-62 


* The metliod of analysis was not sufficiently sensitive to measure reliably the small 
separation in Kr-X mixtures. 



Table 1 


409 


Thermal diffusion in mixtures of the inert gases 



o 


CO 


CO 


<55 


lO 


CO 


rH 


05 



oo 


CO 


CO 


CO 


lO 


IC 




pH 



o 


6 


6 


6 


6 


6 


6 


6 



'SH 

o 

CO 



CO 

CO 

o 

i> 



lO 

CO 

o 

05 

o 


CO 


CO 

CO 

CO 

ip 

CO 

CO 

ip 


CO 

pH 

CO 

<55 

lO 

CO 


6 

cb 

6 

l> 

6 

tH 

6 

cb 

o 

lb 

6 

cb 

6 

<N 

6 

<55 

O 


o 

CO 

o 


<55 

CO 

o 

i> 

00 

pH 

00 

o 

CO 

I> 

O 

O 

CO 

tH 

CO 

pH 

CO 

O 

CO 

cp 

lO 

ip 

CO 

QO 

CO 

<N 


ip 

CO 

6 


o 

lb 

6 

lb 

6 

<5? 

6 

cb 

6 

cb 

6 

pH 

o 

pH 

o 


00 

ID 

05 


<55 

CO 

05 

i> 

lO 

05 

lO 

'ci< 

<55 

CO 

<55 

GO 

CO 

rH 

CO 

Ip 

CO 

lO 

CO 

o 

lO 

i> 

lO 

oo 

lO 

lO 

CO 

CO 


6 


6 

cs? 

6 


6 


<5 

pH 

6 

pH 

6 

6 

o 

6 








CO . 


<55 


lO 


05 




O 

CO 

o 

CD 

o 

CO 

O 

CO 

o 

lO 

o 

lO 

o 


o 

<55 

o 

6 

6 

6 

6 

6 

6 

6 

6 

o 

6 

6 

o 

6 

6 

6 

6 

6 

o 

'cH 

00 

CO 

o 


o 

CO 

lO 

oo 

<55 


<55 

CO 

00 

<50 


00 

CD 

pH 

CO 

CO 

CO 

ID 

CO 

05 

Htl 

CO 


]> 


CO 

pH 

'f 

1 

6 

05 

1 

6 

1 

6 

1 

6 

6 

I 

o 

pH 

I 

o 

pH 

1 

6 

6 

1 

6 

6 

1 

o 

CO 

o 

pH 

o 


O 

CO 

CO 


00 

o 

o 

CO 

lO 

<55 

CO 

Cp 

CO 


CO 

pH 

CO 

05 

CO 

oo 


<p 


cp 


CO 

pH 

C50 

CO 

1 

6 

1 

6 

lb 

I 

6 

1 

<5 

pH 

1 

o . 

05 

I 

6 

cb 

1 

6 

6 

( 

6 

6 

1 

o 

o 

lO 

CO 

o 

pH 



CO 

CO 

05 





05 


CO 

CO 

CO 

lO 

ip 

CD 



CO 

CO 

O 

CO 



00 

o 


lb 

! 

6 

cb 

1 

o 

tH 

1 

6 



1 

o 

1 

6 



6 

1 

6 



00 

lO 

lO 

o 

lO 



pH 

00 

o 

o 



lO 



O 

lO 

CO 

lO 

I> 

lO 




<55 

o 

CO 



05 



1> 

6 

cb 

o 

d> 

6 



cb 

6 

lb 

6 



6 




tH 

pH 

cO 

CO 


<55 

<55 

05 

CO 

05 

lO 

CO 

CO 

00 

CO 

lO 


pH 

pH 

pH 

pH 

pH 

o 

pH 

pH 

o 

pH 

6. 

6 

O 

6 

o 

6 

<5 

6 

6 


ifB lO iS W 

|i5 >o W »o jz; 


50-4% A Br 0-13 0-16 0-18 0-29 0-37 0-46 0-17 



410 


K. E. Grew 


The apparent dependence of iJy on the composition is, at least in its direction, in 
accord with theoretical predictions for molecules exerting an inverse power repulsion 
(Chapman 1940, p. 58 ). The higher value of 0-80 is supported by Bliih, Bliih & 
Puschner (1937); on the other hand, values of deduced from the viscosity of the 
pure gases (Clark Jones 1940, p. 116 ) favour the lower value found in the present 
work. 


Discussiok of results 

The results confirm the earlier ones (Ibbs & Grew 1931) in that they show that in 
general R^ decreases as the temperature falls. In some mixtures, such as He-Ne, 
R^ is constant over almost the whole temperature range; in A-Kr R^ shows a con¬ 
tinuous variation over the whole range. Other mixtures show a more or less extended 
temperature range in which R^ is constant, and below this a range in which Rq. 
diminishes as the temperature falls. It is a reasonable conclusion that this behaviour 
would be shown by all the mixtures were the temperature range sufficiently extended. 
The constancy of over a range of temperature indicates that the molecules are, 
in that range, behaving as if they exerted repulsive forces only. 

The maximum mine of Rj, and the corresponding force index. It is remarkable that, 
although the temperature at which Rg, first becomes constant varies widely from 
mixture to mixture, the value of R^ when constant, that is, the maximum value— 
Rg, (max.), is much the same for all mixtures, viz. about 0 - 64 . Thus*the change in 
Rg^ in passing from one to another of the inert gas mixtures, observed by Atkins et al. 
{1939), is now seen to be an accident of the temperature range to which their measure¬ 
ments were restricted. It appears that at sufficiently high temperatures Rg, would be 
nearly the same for all. 

Assuming that, at high temperatures, the molecules behave like centres of re¬ 
pulsive force only, F = an estimate can be made of the force index v by com¬ 

paring the experimental value of jR2T(max.) wdth the theoretical values of Rg for 
different values of v, A difficulty arises in the comparison, however, in that theoretical 
values of Rg, are known exactly only for the special case of a mixture in which the 
ratio of the molecular masses is very large and the concentration of the heavier 
molecule tends to unity (the Lorentzian case).* In the general case only the ratio of 
the first approximations to kg,, [kg,{v)]J[kgr{oo)\, is known, whereas the theoretical 
value of Rg, is more properly given by kg,{v)l[kg,(oo)]^ (where kg{v) is the exact value 
of kg,) to correspond with the experimental value of Rg, defined as 

kg, (experimental)/[)fcy(oo)]i. 

For small values of the mass ratio and of v the error of the fust approximation is 
small, but it increases to about 23 % in the Lorentzian case with y = oo. In the 

* Values of v have sometimes been estimated from theoretical values of for this special 
case, but more recent work (Grew 1941) has shown that this leads to considerable error, and 
the procedure here adopted is preferable. 



Thermal diffusion in mixtures of the inert gases 411 

absence of an exact knowledge of the variation of the error, it may be assumed that 
it increases linearly with the proportionate mass difference defined by 

M = (mi-~m2)/{mi+m2). 

The error has been estimated for the case iff = 0 and for v = 5 , 9 , 13 , 17 , oo (Grew 
1941, p. 397 ), and it is known in the Lorentzian case (iff = 1) for these same values 
of V. Linear interpolation thus gives the error for intermediate values of iff and v. Then 
using Chapman’s (1940, p. 58 ) data, curves can be drawn showing jR^^as a function 
of V for various values of iff. From these curves the values of v shown in table 2 
were found to correspond with the experimental values of As the mixtures are 
not isotopic, v derived in this way must be regarded as a mean of v^, The 

possible error of 0-01 in iJy introduces an uncertainty of 0-4 in the values of v. 

Table 2. Values of the peopoetiohate mass eatio iff, of Rj. 

AND OF THE COEEESPONDINO FOECE INDEX V 


mixture 

He-Ne 

He-A 

He-Kr 

He-X 

Ne-A 

Ne-Kr 

Ne-X 

A-Kr 

A-X 

M . 

0-67 

0*82 

0*91 

0-94 

0-33 

0*61 

0*73 

0*36 

0*53 

jRj,(max.) 

0*64 

0-66 

0*67 

0*66 

0-57 

0*64 




V 

11*0 

10-9 

10*8 

10*5 

10*9 

11-2 





Thus in the six mixtures for which values can be stated v is close to 11. This suggests 
that, in so far as they can be represented by an inverse power repulsion, that is, at 
high temperatures, the fields of the inert gas atoms are alike in having the same, or 
nearly the same, force index and differ only in the force constant. 

The variation of i?y with temperature. Reference has already been made to Clark 
Jones’s (1940, 1941) calculation of R^for the Lennard-Jones molecular field, which 
includes an inverse power attraction superimposed on a repulsive force: thus 
F = Kr^—K'r~~^\ The calculated values refer to the special case,of v = 9 , v' = 5 , 
R^is given as a function of a quantity ejlcT, where h is the Boltzmann constant, T 
the absolute temperature, and e is the minimum potential energy of two interacting 
molecules. According to Lennard-Jones and Devonshire, e is related to the critical 
temperature Tq of the gas—e = kTg, or better e = }cTqII -22 (cf. Fowler & Guggen¬ 
heim 1939, p. 345 ). Using this latter relation, the curve marked ( 9 , 5 ) of figure Za, b 
has been drawn^ representing iJy as a function of log TITq. The curve ( 9 ,00) is that 
for a molecule which exerts a repulsive force only, with index 9 . In both cases the 
values of apply to an isotopic mixture in which M is small. Also, R^f is here the 
ratio of the first approximations’ [h 2 ,{ 9 , 5 )]J[k^{co)]j^. 

To compare the experimental results with these theoretical ones some assumption 
must be made about the "critical temperature’ to be assigned to the gas mixtures 
since they are not isotopic. Clark Jones gives reasons for taking as the critical 
temperature Tq of a mixture the mean proportional of the critical temperature of 
the two components. If this is done the following value are found: 

mixture He-Ne He-A He-Kr He-X Ne-A Ne-Kr Ne-X A-Kr A-X 

Tc 15 27*5 32*5 38 81-5 96 112 178 209 



412 


K. E. Grew 


The experimental values of at temperature T {T — of table 1 ) may now be 
plotted against log TjT^y. But as (max.) is not the same for all the mixtures, 
partly because of the dependence on the mass ratio, the maximum values of 
for each mixture (except A-Kr, A-X, for which a constant value of jR^is not reached) 
have'first been naade equal to 0*63 and the other values changed in proportion. 
These are the values shown in figure Za, b. 




r^GTTBE 3a, 6. as a fxmction of log TjTQ. Experimental values O x etc. (9, 5): Qlaa*k 
Jones’s theoretical values. (9, oo): for repulsive field only, index 9. Broken curve: a reference 
curve for comparison of figures 3 a, h and 4. 

It is apparent from the curves that: 

(1) The experimei^al values of are higher than the theoretical value for the 
same value of log TjT^, In particular, the maximum experimental value is greater 





413 


Thermal diffusion in mixtures of the inert gases 

than the maximum theoretical value, and since in this temperature range the 
attractive field has little effect, the discrepancy is an indication that the repulsive 
force index in these mixtures is greater than 9. It has, indeed, already been estimated 
to be 11 when a repulsive field only is assumed to be operative. This value would be 
somewhat smaller for a Lennard-Jones field, for Clark Jones’s results show that in 
this temperature range the effect of the attractive field is to increase above the 
value for the repulsive field only. 

( 2 ) In some cases, such as Ne-A, the form of the experimental curve is strikingly 
similar to that of the theoretical curve. More frequently, however, changes less 
rapidly at the lowest temperatures than at the intermediate ones, so that the 
experimental curve becomes convex to the temperature axis. The explanation of this 
behaviour may perhaps be given when there are available theoretical data for force 
indices other than (9,5). It is a reasonable supposition, however, that as v' increases 
above 5, R^ changes sign, if it does so at all, at progressively lower temperatures. 
As a probable value of v' is 7, this would explain the fact that the experimental results 
give no indication of R^ becoming negative. 


0*8 








0*6 








Rji 

0*4 


^1 






0*2 

' 





0 Ne*®- 

Ne®’ . 

'40 

i 

"*oi 

j/y 

y * 

/ 

/ 


log%c 



I 

1 



- 0*2 0 0*2 0*4 0*6 0*8 1-0 1*2 1*4 


Figijee 4. Stier’s experimental values for Ne isotopes and for A isotopes. Broken curve: as 

in figure 3. 

(3) The experimental curves are coincident (except at the lowest temperatures) 
to a degree which is significant when the range of the ‘ critical temperature ’ and the 
method by which it was calculated are taken into consideration. The formal relation 
between R^ and the critical temperature brought out by Clark Jones has therefore 
this inuoh experimental support. 

The variation of R^ in isotopic mixtures. It is inter^ting to compare with th^ 
results for non-isotopic mixture the measurements of R^ in the isotopic mixture 
Ne^^^-Ne^ and A^^-A^® made by Stier ( 1942 ). The curves of figure 4 show Stier’s 
values of jRy as a function of TjT^. The A values, and the Ne^values too in the 




414 


K. E. Grew 


low-temperature range, conform satisfactorily with the others, when it is remembered 
that the mass ratio M in these cases is only 0*05 and may in consequence be 
10 to 20 % smaller than for the non-isotopic mixtures with their larger mass ratio. 
The persistent increase of at the higher temperatures in the case of the Ne 
isotopes is unexpected. t 

In spite, then, of the difficulty introduced into their interpretation by the non¬ 
isotopic nature of the mixtures, it seems that the experimental results could be 
satisfactorily explained with molecular fields of the Lennard-Jones type if the force 
indices were suitably chosen. The results suggest, indeed, that the measurement of 
the temperature variation of R^ in isotopic mixtures would permit of the determina¬ 
tion of both indices, v, p', with some precision. A knowledge of these values combined 
with the treatment of the second virial coefficient due to Lennard-Jones ( 1931 ) and 
Buckingham ( 1938 ) would enable the force constants a:, /c' to be determined, and the 
molecular fields would then be completely known. Further progress depends on the 
provision of more experimental data for isotopic mixtures, and of theoretical data 
for molecular fields with values of (p, p') other than (9,5). 

I acknowledge with gratitude a grant from the Carnegie Trust for the Universities 
of Scotland for the purchase of apparatus and materials. I am indebted also to 
Dr R. Fiirth and to Dr T. L. Ibbs for discussions and criticism. 


References 

Atkins, B. E., Bastick, R. B. & Ibbs, T. L. 1939 Froc, Roy. Soc. A, 172, 142. 

Blub, G., Blub, O. & Puschner, M. 1937 PhU. Mag. 24, 1103. 

Buckingham, R. A. 1938 Proc. Roy. Soc. A, 168, 264. 

Chapman, S. 1940 Proc. Roy. Soc. A, 177, 38. 

Fowler, R. H. <fc Guggenheim, E. A. 1939 Statistical Thermodynamics. Camb. TJniv. Press. 
Grew, K. E. 1941 Proc. Roy. Soc. A, 178, 390. 

Grew, K. E, 1945 Nature, 156, 267. 

Ibbs, T. L. & Grew, K. E. 1931 Proc. Phys. Soc. 43, 142. 

Jon^, R. Clark. 1940 Phys. Rev. 58, 111 . 

Jones, R. Clark 1941 Phys. Rev. 59, 1019. 

Leimard-Jones, J. E. 1931 Proc. Phys. Soc. 43, 461. 

Nier, A. O. 1940 Phys. Rev. 57, 338. 

Stier, L. G. 1942 Phys. Rev. 62, 548. 



A theory of the dependence of the rate of detonation of solid 
explosives on the diameter of the charge 

By H, Jones, Imperial College, London 
{Communicated by Sir Geoffrey Taylor, F,B.S.—Received 16 September 1946) 

The velocity of a detoiwition wave, in a cylindrical charge of solid explosive, is shown to be 
dependent on the diameter of the charge, and the relation between the velocity and the 
diameter is calculated. It is shown that this effect depends upon the rate of the chemical 
reaction occurring in the front portions of the detonation wave, and that it is possible, 
therefore, to determine this rate of reaction by measuring the velocity of detonation in hare 
charges of different diameters. The effect of a metal case surrounding the charge is also 
briefly discussed. 

Introduction 

A successful theory of the detonation of gases contained in tubes was first given by 
D. L. Chapman ( 1899 ) developed in greater detail by Jouguet ( 1906 ). 

These authors were able to predict with considerable accuracy the velocity of the 
detonation wave in an explosive gas in terms of the heat of the reaction, and such 
quantities as the specific heats of the resultant gases. An interesting feature of the 
theory is that, not only is the velocity of the detonation wave determined, but also 
the pressure and the temperature immediately behind the detonation front. In 
accordance with many observations, the theory is based on the assumption that the 
detonation process is a steady one, i.e. that the velocity of the detonation wave 
remains constant. It is on this account that, in this theory, the rate of the reaction 
occurring in the initial portion of the detonation wave does not enter into the 
calculation of the velocity of propagation. 

It is natural that the Chapman-Jouguet theory should have been applied to the 
detonation of cylindrical charges of solid and liquid high explosives. Many authors 
have attempted this extension of the theory but with only li m ited success. There 
are two main reasons why a theory of detonation in solids is more difficult than in 
gases. First, because the products of the explosion are, in general, more complicated; 
many different molecular species being formed, and, secondly, because the pressure, 
temperature and molecular volume in the detonation wave* have values which lie 
far outside the range which has been studied in other connexions. Thus very little 
guidance is available for setting up an equation of state applicable to the conditions 
of the detonation wave. 

Apart, however, 'from the difficulty associated with i;he equation of state, the 
dynamics of the Chapman-Jouguet theory require modification when applied to 
solid explosives. This arises from the fact that, while the detonation of a gas in a 
rigid tube leads to a flow which may be regarded as strictly one-dimensional, in 
solid high explosives no practicable confinement can prevent a certain radial 
expansion of the decomposing materials. As far as the determination of the velocity 
of detonation is concerned, only the flow within that r^on of the wave where the 


Vpl. 189. A. 


[ 416 ] 


27 



416 


H, Jones 


decomposition of the explosive takes place is of significance. The subsequent flow 
of the reaction products has no effect on the steady velocity of the detonation wave. 

The effect of radial expansion within the reaction zone can easily be understood 
in general terms. It is known from experience that the detonation velocity of a solid 
explosive generally increases with the loading density. Hence, if as a result of radial 
expansion the density is reduced before the reaction is complete, part of the explosive 
decomposes at a lower effective loading density, and a velocity of detonation may 
be expected less than that which would occur if no radial expansion took place, and 
therefore less than that given by a straightforward application of the Chapman- 
Jouguet theory. It is with this problem of the reduction of the detonation velocity 
by radial expansion within the reaction zone that this paper is concerned. The main 
interest in this matter lies in the fact that it provides a means of determining experi-. 
mentally the rate of the reaction in a detonation wave. Clearly the degree of effective 
radial expansion depends upon the confinement of the charge and upon the length 
of the reaction zone. A knowledge of the length of the reaction zone gives at once 
the time required for the complete decomposition of the explosive and the release 
of the chemical energy. In the second and third sections of this paper formulae are 
given from which the time of reaction can be obtained &om data giving the variation 
. of the velocity of detonation with the diameter of the charge, or, for cased charges, 
with the weight of the casing. 

It is often convenient to consider the detonation process as judged from axes 
moving with the detonation wave. The front portion of this wave is simply a powerful 
shock wave travelling through the unexploded material. It is the compression and 
the temperature in this shock-wave front which initiates the reaction. If we imagine 
the wave to be travelling from right to left, the shock-wave front, from the moving 
axes, appears to be stationary and unexploded material appears to be passing 
through it from left to right with the detonation velocity. It is assumed that, at 
points near the axis, the shock front may be regarded as a plane normal to the axis. 
Since the detonation process is a steady one the velocity of detonation is, therefore, 
by this assumption, determined entirely by the dynamical conditions within a small 
stream tube coaxial with the charge. For any other stream tube, off the axis, the 
expansion of the cross-section is, in general, greater than that of the axial tube in the 
same distance, and therefore leads to a smaller steady velocity. However, since the 
whole wave proceeds without change of form it follows that those parts of the shock 
front near to the cylindrical surface of the charge must be inclined to the axis. This 
paper, however, will not be concerned with the form of the detonation front, but 
only with the way in which the radius of the charge, or confinement by metal tubes, 
affects the velocity of detonation. 

Effect of the radial expansion on the velocity of detonation 

Consider first, from a reference system moving with the detonation wave, a 
stream tube, whose axis coincides with that of the charge, and which begins at the 
shock-wave front. Let the radius off this stream tube at any point further along the 



417 


Detonation of gases contained in tubes 


jet be denoted by r in units of its initial radius. It is convenient to.work entirely with 
dimensionless variables such as r. Thus if p denotes the pressure in excess of one 
atmosphere, TJ the velocity of the detonation wave, A the loading density of the 
explosive, and p the density of the fluid at any point beyond the shock-wave front, 
thenlet 

y = Alp, z=^pIU^A, (1) 


The reaction is initiated at the shock-wave front and develops over a certain distance 
of the jet; this region is referred to as the reaction zone. Let the fraction of the 
explosive which has decomposed up to a certain point in the reaction zone be e, so 
that € varies from 0 to 1 from the shock-wave front to the point where the reaction 
is complete. If, at any point in the reaction zone, E denotes the internal energy per 
unit mass measured relative to the energy of the unexploded material at atmospheric 
pressure, and Q the chemical energy which has been liberated per unit mass by the 
reaction up to that point, then both the quantities E and Q are functions of e, y 
and z, and each has the dimensions of the square of a velocity. 

The conservation laws, applied across the discontinuity of the shock-wave front, 
lead, as is well known, to the equation 

P) 


The work done by an element of the fluid by expansion, per unit mass, as it passes 

through the detonation wave is given by Ji>d(l/p) where the integration is taken 

from the value of the .density p^ at the high pressure side of the shock front to the 
value p at the particular point under consideration. Thus, if a dimensionless function 
/ is defined by the equation 

f{e,y,z)^V-\E-Ql (3) 

the equation expressing the conservation of energy can be written in the form 

• Cy 

= i53o(l-yo)- 2:%, (4) 

J 3/0 

where z^ and y^ correspond to the pressure and the density at the high pressure side 
of the shock front, i.e. at the front of the reaction zone. In this equation it is assumed 
that no appreciable amount of energy is transmitted from one element of the fluid 
to another by heat conduction. 

As has already been explained, in aU cases the relative radial expansion of the 
stream tube which is being considered is small, and therefore it is a legitimate 
approximation to assume that at every point of the cross-section the pressure is 
the same. Also, of course, to the same approximation the density and the fluid 
velocity are constant over the cross-section. The fluid velocity, relative to the 
detonation wave at rest, is denoted by g, so that the equation of continuity is 

qU-^ = yr-^, (5) 


27-2 



418 

and Bernoulli’s equation is 


H. Jones 


15 ^ 24 . = constant. ( 6 ) 

One cannot immediately carry out the integration in this equation, because here 
p is not merely a function of p, as for adiabatic expansion, but depends also upon e. 
Using ( 1 ) and ( 5 ) and the fact that r = 1 when 2 = Zo, i.e. at the front of the reaction 
zone, ( 6 ) can be expressed « 

ydz = \yl. (7) 

J Zo 

Appljing the laws of the conservation of mass and momentum across the shock- 
wave front gives, in terms of the non-dimensional variables, 

250 = 1 -^ 0 - 

Differentiating (7) one can write 

dz = iyd{r~^) — 


so that integrating and using ( 8 ) gives 

2 ;= 1 —— 2 J yr-^d/r, (9) 

If the v^ues of the variables r, y, z, when 6 = 1 , are denoted by r^, j/i, and if 
a mean value of y throughout the reaction zone is defined by the equations 

J yf-^^dr = yj ij?(l (10) 

then (9) becomes, for e = 1, 

% == (n) 


In the Chapman-Jouguet theory it is assumed that the gas velocity relative to 
the detonation wave, at the point where the reaction is complete, is just equal to 
the velocity of sound in the product gases at that point. This so-called Chapman- 
Jouguet condition has been the subject of much discussion and it now appears to 
be well justified both theoretically and also experimentally in so far as it leads to 
correctly calculated detonation velocities. This condition will also be applied here 
in the case where there is a small radial expansion within the reaction zone. This 
should, in general, be substantially correct though in the rather special case of an 
explosive containing a very slowly reacting component it may need modification. 
Accordingly 

( 12 ) 


y\ 


+-?i- = 0 


where dzfdy denotes the adiabatic variation of pressure with volume. When e = 1 , 
equation (4) represents the adiabatic variation of y with z so that 


ei-i'-d),]/®, 

and the Chapman-Jouguet condition becomes, using ( 5 ), 


(13) 


( 14 ) 



419 


Detonation of gases contained in tubes 

When / is a known function of y and z, equations (11) and (14) determine y-^ and %, 
in terms of and y. If these values are substituted into equation (4) one obtains 
a relgition which determines as a function of r^. To do this first simplify (4) 
by using (7) and so obtain 

( 15 ) 

From this point no further progress can be made without some knowledge of the 
form of/for the product gases of the detonation. It is not an unsatisfactory approxi¬ 
mation to assume that the internal energy is proportional to the temperature, and 
it will be shown that it is not necessary to know the constant of proportionality, for 
this can be eliminated from the final result. The dependence of / upon the pressure 
and the density is, however, needed and this requires an equation of state applicable 
to the conditions existing in the explosion products. For the purpose of calculating 
the detonation velocity from thermochemical data a rather complicated form is 
needed to give good results, but for the present purpose, which is to obtain the 
correction called for by radial expansion, it should be sufl&cient to use an equation 
depending simply on a constant co-volume, especially as this co-volume can be 
eliminated from the final result. Therefore it may be assumed that 

/(l.2/i>Zi) = -b)-Qx U-^ (16) 

where k and b are two constants, the former being related to the specific heats of the 
gaseous products and the latter being the co-volume in units of 1/d. 

Substituting from (16) into (14), and solving the resultant equation together wdth 
(11) for y^ and one obtains 

. (1 + 2*)^! = (17) 

(l-t-2*)2i = k{l-iy(l-r^^)}-bkri\ (18) 

Introducing these values of j/i and Zj^ into (15) and using equation (16) one obtains 

« 

2(l + 2i:)||=P(l-6)2+(l+A;)2[7^{l-iy(l-rr")}2-i]+A26(y-6)(l-r^^^^^ (19) 

When there is no radial expansion in the reaction zone, i.e. == 1, the velocity of 
detonation may be denoted by and from (19) one obtains 

2{l + 2k)QJUl = k%l-b)K (20) 

Eliminating from (19) and (20) and rearranging the terms finally results in 

( 21 ) 

It is worth remarking that this equation remains valid even when (r^ — 1) is not small. 
The form of the equations of motion and continuity, (6) and (5), only requires that 
the rate of expansion along the stream tube shall be small. However, in practical 
applications, (r^ — 1) is usually very small compared with unity, i.e. Z7 is, in general. 



420 


H. Jones 


only a Kttl^less than the maximum possible velocity Hence, one may, to a 
sufficient approximation, write = 1 for all terms inside the square brackets. These 
terms are denoted by o and therefore * 

For 7*1 = 1 equation (17) becomes 

= + (23) 

and, using this equation, ( 20 ) may be written 

l + 2i = ^(l-j/i)-^. (24) 

Uq is known with considerable accuracy for many explosives, and a very good idea 
of the magnitude of can be obtained from experiments such as those of Robertson 
& Gamer ( 1933 ). These experiments do not give exactly because the end products 
after adiabatic expansion are, in general, rather different from the products in the 
detonation wave front. The most uncertain quantity on the right-hand side of (24) 
is ?/i. The reciprocal of this, 1 jy-^, is the compression ratio, i.e. the ratio of the density 
of the explosion products in the detonation front to the loading density of the 
explosive. Detailed calculations according to the Chapman-Jouguet theory usually 
give values of this ratio about 1*33, varying somewhat from one explosive to another. 
It will therefore be assumed that = I- Uq take 6*8 x 10 ® cm./sec., which 
is about the value for T.N.T., and for Qi 1000 cai./g. which is also about right for 
T.N.T. These values give, according to (24), ib = 0-95 and according to (23) 6 = 0-237. 
Thus ( 22 ) becomes 

c = 7-24(1-y) + 0-41(y-0*237). (25) 

At the beginning of the reaction zone pQ must be rather greater than the true crystal 
density of the explosive, since in the shock front the pressure is very high, being of 
order 10 ® atm. Now for cast explosives A is usually not very much less than the 
true density, so that for such explosives y^ cannot be a small fraction. As we know 
from the dynamics that y^yi throughout the reaction zone, it may be assumed that 
for cast explosives y is approximately equal to y^. In this limiting case, in which 
y = I, we have, from (25), c = 2 - 02 . 

For charges made up of powdered explosives the loading density may be a half, 
or less, of the true density, in which case y may be somewhat less than 0 - 75 . If the 
value y = 0-6 is taken, then (25) gives c = 3-05. Thus it must be concluded that the 
numerical value of the coefficient c in the formula 

(^j 

is approximately equal to 2-0 for cast explosives, but may be a little greater for 
powdered charges of low loading density. 



Detonation of gases contained in tubes 


421 


The radial expansion of a bare cylindrical charge 

In this section, the value of is calculated for a bare cylindrical charge, of radius 
jB, detonating in a vacuum. Instead of actually finding the expansion in the reacting 
gases, which would be very difficult, the corresponding radial expansion in a jet of 
the product gases is found, assuming an adiabatic pressure volume relation. The 
value of is taken to be the relative expansion of the axial stream tube in a distance 
X equal to the length of the reaction zone. The dependence of the degree of expansion 
on X and the charge radius R should be very similar indeed in the two cases and, 
therefore, this approximation should give a satisfactory account, with equation (26), 
of the dependence of U upon X and R. 



The method of calculation is best explained by reference to a figme. In figure 1 
let AB denote the plane of a nozzle from which gas streams into an evacuated space 
from left to right. The velocity at AB is equal to the local velocity of sound. If Pq 
and Pq are the pressure and density at is it assumed that elsewhere, in the stream, 
the relation between p and p is given by 

( 27 ) 

The appropriate value to take for y in this equation can be obtained as 
follows. Consider the expansion just behind the detonation front in the purely 




422 H. Jones 

one-dimensional case in which there is no radial expansion, then, according to the 
Chapman-Jouguet condition. 



where the form (27) is used for the adiabatic relation. According to ( 6 ), for r = 1 , 
it is found that g = yU and therefore (28) gives — yz^, which together with (9), 
again for r = l, gives y = ^i/(l- 2 /J. (29) 

As in the previous section, take y^ = f, from which results 7 = 3. The adiabatic 
relation (27) with 7 = 3 wiH, of course, be only approximately correct for small 
expansions from the initial condition at AB. This, however, is all that is required 
in the present connexion since we are only concerned with distances along the stream 
less than a radius in length. The physical reason for the high value of y is that the 
explosion products form initiaUy a very imperfect gas. 

The flow in the immediate neighbourhood of a point such as A or .B is identical 
with the flow round a straight edge, which may be analyzed simply, as was first 
shown by Meyer. It will be assumed that the form of the stream lines, and the 
pressure along them, near the circular edge through A can be taken from Meyer’s 
solution. It appears, for points not more than a radius downstream and not further 
from the edge than is indicated by the point D, that this approximation is not too 
unsatisfactory for the present purpose. Consider a tube generated by the 
rotation of a stream line of Meyer’s solution such as DO about the axis, then within 
this region calculate the flow and the pressure, using equations ( 6 ) and ( 6 ). The 
pressure along the same line DO given by the two solutions will not, of course, be 
quite the same. That stream line is chosen, therefore, which minimizes the integral 
of the magnitude of the pressure difference along a distance R of the jet. 

Let the components of the velocity at O be denoted by u and v, u referring to the 
component along the radius vector s, i.e. from A to 0, and v the component at right 
angles to s counted positive in the direction of 6 increasing. 6 is the angle which s 
makes with the plane of the shock front denoted by AB. If F denotes the velocity 
of the issuing gas, which in this case is equal to the local velocity of sound, Meyer’s 
solution is expressed by the following equations: 

v=V GOsXd, 
u = (F/A)sinAd, 

where A = {( 7 — l)/( 7 -l- 1 )}*. 

V is given in terms of the density by the relation 

4 # 

dp 

The equation of a stream line in polar coordinates {s, 6) is as follows: 

.s = 5 o(cosA^)-i/^“. (34) 


(30) 

(31) 

(32) 


(33) 



423 


Detonation of gases contained in tubes 

Expressed in terms of r and 6 this equation become^ 

r = 1 +<(>{1 — (cos cos 6), . (36) 

where (o = sJ{R—Sq). (36) 

The pressure along the stream line as given by Meyer’s solution is denoted by p„. 
Thus, according to (27), (30) and (33), ^ 


^ = (cos A0)2r/(r-i). (37) 

Now consider the flow inside a tube generated by rotating the stream line (34) 
about the axis of symmetry. It will be assumed that inside this tube the pressure 
may be regarded as‘constant over planes normal to the axis. Thus Bernoulli’s 
equation may be written as follows: 




\iP 






(r~i) 

and this, together with (27) and the equation of continuity 

pqr^ = UA^ 

//nA2yy r A/ 

gives at once 




(38) 

(39) 

(40) 


Using (35) to determine r, we find from (40) the pressure along the inside of the stream 
tube as given by the solution based on (38) and (39). This pressure is denoted by 
When 7 = 3 the formulae simplify, and if non-dimensional variables are introduced 
as follows: 

^m={PjP0% («) 


one gets from (36) and (40) 

■ 1,(2 - ^,) = {1+0(1-cos-2 (0/V2) cos 0)}-* (42) 

and from (37) ^ = cos® {djf2). (43) 

The factor (1 — cos® cos 0),-which is denoted by g{6),- is small for all values of 
6 less than about 60°, in fact g^(60) = 0*0824. Thus, since o) is less than unity, ex¬ 
pansion of the fourth power in (42) results in 

^, = l-2a>Vfl^(^)* ■ (44) 

Now find the stream line which gives the best approximation, i.e. the best 
value ofo) to give the closest agreement between over a distance along the 

axis of the order of the length of one charge radius. This is done by minimizing the 
integral i 

(45), 

with respect to o>. 



424 


H. Jones 


The range of 6 from 0 to 1‘0 radian corresponds to a distance along the axis of 
about one charge radius. Minimizing (46) is simpler than minimizing the integral 
of (Pi-Pn)^ and is as effective for the present purpose. In this way 




(46) 


is obtained, and a numerical integration gives co^ = 0*922, and therefore (o = 0*85. 
Denoting distance along the axis by x gives, as can be seen immediately from the 
figure and equation (34), 


\l +( 0 ) c< 


sin^ 


' cos® {0Q2) ■ 

Hence, substituting the numerical value of o) into (35) and (47), the following 
equations, which give r as a function of a:/i? in terms of the parameter 6, are obtained: 

r = 1-85 (1 - ix/B) cot 5), (48) 

). (49) 

\ 1 -f- cos OJ 


R 


i^l4-cos4^2 6 ) 

The following table shows numerically the relation between r and xjB given by these 
equations. 

x/R 0*2 0-3 0-4 0-5 0*6 0-7 0-8 0-9 1*0 

(>•-1)10® 0*l6 0-48 1*20 2*40 4*08 6*24 8-85 11*80 15-10 


The percentage diminution in the detonation velocity due to radial expansion within 
the reaction zone may be taken as 10^(170— U)IU, and from (26), with c = 2*0 and 
remembering that (r^ — 1) <^ 1, 

102(C7,--i7)/?7 = 4(ri~ 1)102 

approximately. Thus the percentage diminution of the velocity of detonation below 
its maximum value is given, for small values of (Ti — I), by multiplying the figures 
of the lower line of the above table by 4. For instance, if the length of the reaction 
zone is one-half of the radius of the charge, the velocity will be 9*6 % below the 
maximum value as given by the one-dimensional Chapman-Jouguet theory, or as 
observed in charges of very large diameter. 

It is useful, for some purposes, to define a reaction time r by the relation X = !7r, 
though it must be remembered, if one wants to connect r with the actual progress 
of the reaction, that the velocity of an element in the reaction zone, relative to the 

rx 

shock-wave front, is not U but g, so that the true time of reaction is which 

J 0 

is roughly fr. 


The rabial expansion op a cased cylindrical charge 

In this section the effect on the velocity of detonation of a metal tube surround¬ 
ing the charge is considered. The present discussion is limited to a rather special 
case; a fuller consideration of confined charges wiU be given in another paper. 



425 


Detonation of gases contained in tvhes 

For the present, it will be supposed that the metal case can be represented by a thin 
flexible sheath of the same mass per unit length. It is not a serious approximation 
to neglect the strength of the metal, except in very special oases, since the yield 
stress, even of steel, is small compared with the stresses arising near the detonation 
front. However, the neglect of the finite thickness and, therefore, of the compression 
of the metal is a serious approximation, and it is Mth this that the later paper will 
deal. 

It can be safely assumed that the expansion of the metal tube is entirely radial, 
so that the mass per unit length, denoted by m, is constant throughout. Moreover, 
for small expansions, it may be assumed that dhjdt^ = TJHh^jdx^ so that in this case 
the equation of motion for the metal case may be written 


dh 27 tA 
dx^ m 


(50) 


This equation could not be solved exactly without a detailed knowledge of the way 
in which the rate of the reaction depends upon the temperature and pressure, for 
only then could we determine the dependence of z upon x and r from the equations 
of the first section. For small expansions, however, z may be assumed to have the 
same form as in the one-dimensional case. It follows that, by (9), 1 -“2 /j where y 

varies between AjpQ at the beginning of the reaction zone and Ajp^ at the end. It 
has been seen previously that for cast explosives both these ratios are approxi¬ 
mately I. Hence (50) will give a good idea of the variation of r with x, if z is taken to 
be constant and equal to Equation (50) may now be written 



(51) 


where cr is the mass per unit area of the confining tube before expansion, i.e. 
27ri2<r = tn'. The solution of (51) for which drjdx = 0 and r = 1 at a; = 0 is 


r = cosh 



(62) 


Thus, if X denotes the length of the reaction zone, 


(63) 


results from (26). When the percentage diminution of the velocity of detonation, 
due to radial expansion, is small, (53) may be written, ass umin g c = 2, 


Uq-U ZM 
U 2<tB' 


(54) 


a numerical example, consider an iron tube 1 cm. internal radius and 4 mm. 
thick, containing an explosive at density 1‘6 g./cm.® and for which X = 5 mm. 
These figures give X/B = 0'5 and ABIcy = 0*6 from which, by (54), the detonation 



426 


H. Jones 


velocity is found to be 6-25 % below Uq. It may be noted that the same charge 
completely bare would, according to the formulae of the preceding section, have a 
velocity of detonation 9-6 % less than Thus for such a charge'the confining tube 
increases the rate of detonation by a little over 3 %. 

The writer is indebted to the Director-General of Scientific Research (Defence), 
Ministry of Supply, for permission to publish this paper, and also to Sir Geoffrey 
Taylor for many helpful discussions. 


Refebenobs 

Chapman, D. L. 1899 Phil. Mag. 47, 90. 

Jouguet, E. 1906 J. McUh. pures appl. 6 (Series II), 5. 

Robertson, R. & Gamer, W. E. 1923 Proc. Roy. Soc. A, 103, 639. ’ 



Antoine Laurent Lavoisier 
26 August 1743—8 May 1794 

By Sie Haeold Haetley, 

(Delivered 16 November 1944 —Received 15 January 1946) 

[Plates 10 and 11] 

We have met to-day to honour the memory of a great Frenchman. Antoine Laurent 
Lavoisier is one of the immortals. In the whole history of science there is no trans¬ 
formation so swift and dramatic as when in 1789, in his great Treatise, he gave 
chemistry its modern form, sweeping av^^ay the cobwebs of centuries which obscured 
its progress. He was a child of his age. Bred in France at a time when the writings 
of Voltaire and Rousseau were stirring men’s nciinds, and Diderot and the Encyclo¬ 
paedists were widening their vision, Lavoisier was a great reformer. It is difidcult 
within the compass of an hour to do Justice to his many-track mind. Ambition, 
curiosity, humanity and love of action took him into many fields, and in each his 
creative genius saw an opportunity for constructive work. The six great volumes 
of his collected writings display an amazing intellectual and practical achievement. 
In chemistry, in physics, in physiology, in chemical engineering, in agriculture, in 
geology, in education, in statistics and in finance he was a pioneer, and in each his 
contribution has the modern touch both in thought and phrase. The marvel is that 
with his widespread interests it was he alone, in a band of brilliant contemporaries 
like Black, Cavendish, Priestley and Scheele, who had the vision of modem 
chemistry and gave it life. Alone among the intellectuals Lavoisier was a man of 
affairs, skilled in administration and finance, and that is in some measure the 
secret of his success. For it can be said of him that he applied not only the balance 
but the principle of the balance sheet to chemistry and physiology. 

Lavoisier’s fife was such a tangled skein of occupations that they caimot be. seen 
in their true perspective under separate headings. Their reactions on each other 
are the clue to so much that, even at the sacrifice of logical sequence, I shall try to 
picture the daily current of his life flowing in its many channels. 

Lavoisier was a Foreign Member of the Royal Society and, although he never 
came to England, at many critical moments of his scientific work he was influenced 
directly or indirectly by Fellows of our Society. You will see that Robert Boyle, 
John Mayow, John Locke through his disciple Condillac, Stephen Hales, that most 
ingenious vicar of Teddington, Black, Priestley and Cavendish aU had their in¬ 
fluence on Lavoisier, 

Early ufe 

^^e was born in Paris on 26 August 1743. His father was a lawyer, and his 
^mther and grandmother came of legal stock, so that he had the law in his blood, 
and no one could draft an opinion more skilfully. After a classical education at the 


Vol. 189. A. (3 June 1947) 


[ 427 ] 


s8 



428 


Sir Harold Hartley 

Mazarin College, he turned to science, and was fortunate in his teachers. Mathe¬ 
matics and astronomy he learnt from La Caille, chemistry from Rouelle, geology 
from Guettard, and botany from de Jussieu. From their teaching not only did he gain 
a broad general scientific background, but he was bitten with the idea of research. 

He qualified as a lawyer, but he could afford to choose his own career, and his 
ambition turned to science. There was a touch of Francis Bacon in his make-up, a 
belief that by making systematic measurements the truth would emerge. He began 
like Dalton as a meteorologist, and his first published paper was on an aurora (i). 
Throughout his life he kept careful records of daily thermometric and barometric 
observations, made by himself and by correspondents in different parts of the 
world. He was always meaning to collate them, but that was never done, and his 
only meteorological paper was on forecasting, in which he suggested the value of a 
daily weather forecast (2). It was in meteorology that he served his apprenticeship 
in physical measurement, investigating the accuracy of the barometer and thermo¬ 
meter, and the best construction of the portable instruments which he sent to his 
correspondents. 

Geology 

In 1763, when he was twenty, Guettard interested him in his project of a 
geological map of France, the first of its kind. It appealed to Lavoisier’s systematic 
mind, and he threw himself into the scheme with characteristic energy, writing to 
ministers for support, and drawing up questionnaires to go to the provinces. For 
seven years Lavoisier was making geological tours to collect data for the survey, 
and his journeys gave him a cross-section of social conditions all over France that 
left its mark on his mind. 

He travelled on horse-back with his servant carrying his barometer, thermo¬ 
meter, hydrometer, and box of reagents. His observations included both mineral 
deposits and the thickness and characteristics of the various strata at different 
exposures. He measured in each case their height above sea-level with his baro¬ 
meter, so that he knew the levels at which they occurred in different places. His 
material formed the basis of sixteen plates in the first geological map of France 
which was published in 1784 by Guettard’s successor Monnet without proper 
acknowledgement of Lavoisier’s work. His only substantial geological paper, on 
recent sedimentary rocks (3), was not published until 1788. in it he distinguishes 
between littoral and pelagic banks which were formed at different distances from 
the land and consisted of distinct kinds of sediments and organisms. His sections 
gave the first outline of a correct classification of the Tertiary deposits of the Paris 
region. Geikie said of Lavoisier that 'if he had not given himself up to chemistry, 
he might have become one of the most illustrious among the founders of geology’. 

Public lcghtikg 

Throughout his life Lavoisier’s interests were divided between pure science and 
its application to practical problems* He had a deep sense of responsibility that 
science should be used in the service of man, and he was always ready to turn 



Antoine Laurent Lavoisier 


429 


aside from his researches to some practical task. In 1765 the Academy offered a 
prize for a paper on the lighting of the streets of a large city and Lavoisier decided 
to compete. He made an elaborate investigation, both practical and theoretical, 
of various methods of illumination; of the relative cost of lighting Baris with candle 
or oil lamps, of the advantage of various types of reflexion, of the time of burning of 
different kinds of oil, and of the use of dissolved resin to prevent the freezing of oil 
in winter. The prize was divided between three manufacturers, but Lavoisier’s 
paper, an early and admirable piece of industrial research, was awarded a gold 
medal given by the King and was published by the Academy ( 4 ). 

Acab:emie bes Scibn-ces 

1768 was the fateful year of Lavoisier’s life. In February the great promise he 
had shown as an investigator was recognized by his election as a member of the 
Academy of Sciences. From the day he joined that august body as 'adjoint 
chimiste’ his wide knowledge and experience, his diligence and his ability in 
drafting memoranda made him one of its most active members. Throughout his 
life a considerable portion of his time was devoted to its service, and Lavoisier 
was a most staunch defender of its privileges. He became a fuU member in 1778, 
director in 1785, and treasurer in 1791 until its suppression in 1793. The Govern¬ 
ment was constantly remitting questions to the Academy for reports, and Lavoisier’s 
was the guiding hand in their preparation. He wrote over fifty memoranda on 
subjects ranging over The Great Frost of 1776, Prison Reform, Mesmerism, Water- 
Divining, the Creuzot Works, and The Development of Balloons ( 5 ). 

The Febme 

A few days after he had joined the Academy he became a member of the Fcrme, 
the financial corporation that leased from the Government for periods of six years 
the privilege of collecting the indirect taxes. The annual payments had to be made 
in advance, so that membership of the Ferme involved a considerable investment 
of capital, the return on which depended on the ability of the administration and 
the state of trade. For Lavoisier it was not merely an investment, as he threw 
himself into the task of administration with his usual energy. His practical ability 
and his conscientious management soon rdade him one of the leading members of 
the Corporation. Lavoisier’s scientific friends shook their heads at his new venture, 
but said cynically 'Well, at any rate he wiU give us better dinners’. 

The collection of customs duties, octroi, and salt tax and the sale of tobacco, 
involving an annual income approximating 200 million livres, required an organiza¬ 
tion of nearly 30,000 officials working all over France. This was supervised by a 
number of committees drawn from the sixty members of the Corporation, their 
work involving a series of visits to the provinces. Lavoisier served on some of 
these, and he was specially charged with inspection of the tobacco factories and of 
the frontier guards to prevent smuggling. These journeys occupied a large part of his 
time from 1769 to 1771. 


28-2 



430 


Sir Harold Hartley 

Marriage 


In 1771 Lavoisier married the daughter of Jacques Paulze, one of his colleagues 
in the Ferme. His wife was only fourteen years old, but she was a girl of character 
and intelligence and devoted her life to helping her husband (see plate 10). She 
learnt languages and translated Priestley and Cavendish for him, and later she 
published a translation of the essays of Kirwan. She was constantly in the laboratory, 
and many pages of the records of his experiments are in her handwriting. She studied 
painting with David, and two drawings showing Lavoisier’s experiments on respira¬ 
tion with Seguin are by her hand (see plate 11). She also drew and engraved all 
the plates for his Traite J^lementaire de Chimie, Benjamin Franklin, in writing to 
thank her for his portrait, said: ‘ It is allowed by those who have seen it to have 
great merit as a picture in every respect; but what particularly endears it to me is 
the hand that drew it.’ 

We get a glimpse of her in Arthur Young’s Diary in 1787(6). 'To Monsieur 
Lavoisier by appointment. Madame Lavoisier, a lively, sensible, scientific lady, 
had prepared a dejeune Anghis of tea and coffee, but her conversation on Mr 
Kirwan’s Essay on^ Phlogiston, which she is translating from the English, and on 
other subjects, which a woman of understanding, that works with her husband in 
his laboratory knows how to adorn, was the best repast.’ 

Gouverneur Morris, the United States Minister in Paris, had to discuss with 
Lavoisier the price the Ferme should pay for Virginian tobacco and also a debt 
that was owing by his Government. His Diary of the French Revolution {^) gives us 
a picture of Lavoisier as a business man and of Madame Lavoisier as his hostess: 

2 June 1789: Return Home, and then call on Monsr. Lavoisier.. ..Talk of a future Contract. 
T h i nk s it should exist for a Part of the Supply. I agree with him and add that I would rather 
contract for the best Quality and leave the inferior Kinds to the Chance of Marketts. He asks 
at what Price. I tell him that what I say must not draw to any Consequence but be considered 
merely as Conversation. That it may be from 32 ^ to 33 He thinks this much too high- 

8 June 1789: Dine with Mr. de Lavoisier; as I am leaving he tells me that the Farm are 
determined to stand Suit and that he is sorry for it. Madame appears to be an agreable 
Woman. She is tolerably handsome, but from her Manner it would seem that she thinks her 
forte is the Understanding rather than the Person. 

26 June 1789: Return Home and take a little Medicine and then go to Dinner at Mr. Lavoisier’s. 
They are just returned from Versailles and Madame gives me the Hews... .At Dinner, Mr. 
Lavoisier, who seems to be a sensible and weU informed Man, teUs me that the usual Produce 
of France is from 6 to 6 Times the Seed sown. 

25 September 1789: Go to the Opera according to my Promise and arrive towards the Close 
of the Piece at the Loge of Madame Lavoisier,... Go to the Arsenal and take Tea with Madame 
Lavoisier en attendant le Betour de Monsieur who is at the Hdtel de Ville. As Madame tells me 
that she has no Children I insist that she is une Parassevsey but she declares it is only Mis- 

fortime-Monsieur comes in and tells us of the Obstination of the Bakers. This Corps threatens 

the Municipality of Paris with a Discontinuance of their Occupation unless a Confrere, justly 
confined, is released. Thus the new Authority is already trampled on. I take Mr. Lavoisier 
apart and propose to him to negotiate with the Farm for the Debt of the United States. I offer 
him also a Concern. He will I think accept it, tho he rather objects. He wishes to know my 
Terms. I speak vaguely. 



Antoine Laurent Lavoisier 431 

25 July 1791: Call on Mr. Franklin and we go together to Made, de Lavoisier’s to Dinner. 
As the Party is rather High-Flying they conclude Haphazard that the Riot at Birmingham 
has been occasioned by the Government. This is ridiculous enough. 

7 November 1791: Mr. Franklin dines here, and we go together after Dinner to see Made. 
Lavoisier where there are a Number of Gens d*Esprit who are in general but so so Company, 

Few men can have been more happily married than Lavoisier. His wife shared 
in all his activities, she helped him to fight his scientific battles, and in the dark 
days to come she risked everything to try and save him. 


Early chemical researches 

Even before his election to the Academy Lavoisier’s interests were turning from 
geology to chemistry. His first two chemical papers were on gypsum, its composi¬ 
tion and its solubility in water (8). Lavoisier used a hydrometer to show that the 
solution of gypsum was denser than water, and this led him to a systematic study 
of the subject of hydrometry in which he devised a new form of constant-immersion 
hydrometer, a small portable version of which he used to examine the densities of 
a large number of natural waters during his geological tour, in order to determine 
their content of dissolved salts. His practical mind saw the importance of water 
supplies to cities, and he was a strong supporter of de Parcieux’s scheme for im¬ 
proving the water supply of Paris, on which he read papers to the Academy in 1769. 

In order to standardize his hydrometers Lavoisier studied the effect of repeated 
distillation on its density and also the variation of density with temperature, 
in which he observed the increase of density to a maximum above the freezing- 
point. He was unaware that this had been discovered in Italy in the seventeenth 
century. 

In 1768 he began to investigate the supposed conversion of water into earth 
which had been a matter of so much controversy since the time of Boyle, Lavoisier’s 
method of attack is characteristic of his experimental methods ( 9 ). He weighed a 
retort, introduced into it a known amount of fresh rain water, and luted on a closed 
receiver to act as a reflux condenser, after heating the retort to expel some of the 
air. The whole apparatus was then carefully weighed and the retort heated for 
110 days in an oil-bath to about 80° 0. A deposit of earth slowly formed in the 
retort, but the weight of the apparatus at the end was unchanged, showing that it 
had gained no fire material nor lost any water through the glass. This must have had 
an important influence on Lavoisier’s ideas by strengthening his tacit assumption 
of the conservation of mass, and by showing him that in chemical changes involving 
exposure to heat the ‘fire material’ did not necessarily, as was often supposed, 
cause any change of weight. The solid was then collected, the water evaporated 
and the weight of the total dry sohd residue was compared with that of the loss of 
weight of the empty retort during the operation. The sohd weighed 20-4 grains, 
while the retort had lost only 17-4 grains. Lavoisier could not explain the greater 
weight of the sohd residue (caused, as Meldrum has pointed out, by the absorption 



432 


Sir Harold Hartley 

by alkali of carbon dioxide from the air), but he concluded rightly that in spite of 
this discrepancy the solid must have come from the glass and not from the water. 

This was the most significant of Lavoisier’s early papers, and it was followed 
early in 1772 by an investigation with Macquer and Cadet of the effect of heat on 
diamonds (lO). The varying resistance of gem stones to heat had excited the curiosity 
of experimenters for centuries, and the causes of the loss of weight of the diamond 
on heating had been continuously a matter for inquiry. Was it due to true volatiliza¬ 
tion as Boyle thought, or to a species of combustion as Macquer’s recent observations 
indicated, or to a decrepitation into small fragments caused by contact with cold 
air ? Those were the rival views which Lavoisier and his friends set out to investigate 
by comparing the effects of heating diamonds in open and closed vessels. Heated in 
an open retort to a high temperature for some hours they were found to lose weight 
and become discoloiired. On the other hand, diamonds heated in a clay pipe filled 
with charcoal and closed with a lute, the pipe itself being enclosed in a nest of three 
crucibles with a similar lute to exclude air, lost no weight and showed only a slight 
superficial darkening and no loss of poHsh after exposure to the highest temperature 
available. Hence they concluded that the loss of weight depends on contact with 
the air and is due either to combustion or to some mechanical effect of the air in 
causing decrepitation. These experiments were continued later by Lavoisier using 
the great burning glass of Tchirnhausen, but the paper was not printed until 1776, 
and it is not clear at what date it was presented to the Academy (ii). Lavoisier’s 
notebook shows that these experiments were still in progress in 1773. 

Combustion 

The experiments on diamonds and probably papers by Sage and Cigna on the 
burning of phosphorus gave Lavoisier an interest in combustion, and his knowledge 
of the work of Boyle and Hales shows that he was studying the literature. At this 
time Stahl’s theory of phlogiston stiU held the field. It was generally accepted by 
chemists that substances are inflammable because they contain this mysterious 
principle, which was liberated with heat and light so long as air was there to 
remove it. Respiration was thought to be a similar process, the air removing 
phlogiston from the lungs. It was a looking-glass hypothesis, a relic of alchemy, 
but it had an astonishing hold on chemists’ minds until Lavoisier broke it. 

The critical moment for Lavoisier came on 10 September 1772(12) when, having 
bought an ounce of phosphorus, he found it could be burnt in glass vessels without 
breaking them and began to examine the absorption pf air already noticed by 
Cigna and others during its combustion. On 20 October he sent a sealed note to the 
Secretary of the Academy giving the results of his experiments which showed that 
air is absorbed when phosphorus is burnt and that the phosphoric acid weighs more 
than the phosphorus ( 13 ). On 1 November he sent another note describing his 
discovery that sulphur, like phosphorus, gains in weight during combustion, and 
that this increase of weight is due to a prodigious quantity of air which is fixed 
during combustion ( 14 ). * 



Antoine Laurent Lavoisier 


433 


Cette d^couverte, que j’ai constat^e par des experiences que je regarde connne decisives, 
m’a fait penser que ce qui s’observait dans la combustion du soufire et du phosphors pouvait 
bien avoir lieu k Tdgard de tons les corps qui acquierent du poids par la combustion et la 
calcination;.. .L’experience a compietement confirme mes conjectures; j’ai fait la reduction 






#0 1C 




ay CJoAvit 




^7 






#w 


~c A«VXs«i« 


.#HA^ y-e/^ 

Cc^ 

*KXiS>r d xnuC<5_ 

#«WM. vodj^ JP- ^Ai^Cu— 

9b ^uTium n . fi - ii.Ami it«> 

4Mru £/»v ♦ffw 

^ 

^>vur 




-ntHe 






Figure 1. The page from Lavoisier’s notebook, 10 September 1772, recording his 
experiments in burning phosphorus. 


de la litharge dans les vaisseaux fermes, avec Fappareil de Hales, et j’ai observe qu’U se de- 
gageait, an moment du passage de la chaux en metal, xme quantite considerable d’air, et que 
cet air formait un volume mille fois plus grand que la quantite de litharge employee. Cette 
decouverte me paraissant une des plus interessantes d© celles qui aient ete faites depuis Stahl, 
j ’ai cru devoir m’en assurer la propriete, en faisant le present depot entre les mains du secretaire 
de I’Academie, pour demeurer secret jusqu’au moment oh je publiera mes experiences^ 



434 


Sir Harold Hartley 

It was a moment of great elation for Lavoisier. To his practical mind increase in 
weight meant gaining something, not losing it, as the phlogistics argued. He saw a 
vision of something much bigger than a theory of combustion, *a revolution in 
chemistry and physics’. He saw, too, that its realization depended on a detailed 
investigation of the gases that were given off or absorbed in the chemical changes. 
It was no easy task, as Lavoisier had had comparatively little chemical experience, 
but he attacked it with courage on a wide front, and the next twelve months saw 
the most concentrated and sustained scientific work of his career. He was con¬ 
fident of his objective, and that confidence k^pt him going through the perplexities 
and disappointments of twelve crowded years. 

Opuscules physiques et chimiques 

The record of his next year’s work was published in a separate volume, Opuscules 
physiques et chimiques^ as it was too long for the Memoirs of the Academy (15). It is 
described in the preface as the first of a series of investigations covering the wide 
field that he already had in view. It was Lavoisier’s apprenticeship to the study of 
gases. He began with a careful study of the literature, and half the volume consists 
of a history of previous work. Hales, Black and Priestley are the three chemists 
to whom he pays special attention. Through Hales he learned the experimental 
methods of John Mayow, and his own apparatus, though more elaborate, is ob¬ 
viously based on that of Hales. His systematic mind appears in the tables in which 
he assembles aU the quantitative results of Hales’s ingenious but almost random 
experiments. Prom Black he learned perhaps more than he said. Black’s pap^ on 
the alkalis was a quantitative study after his own heart, and broadly speaking he 
was to do for metals and oxides just what Black had done for mild and caustic 
alkalis. 

Having finished his survey of the literature, on 22 February 1773 he began his 
laboratory notebook vrith a remarkable forecast of the work it was to record (16): 

Qui m’a paara. fait pour occasiomier uae revolution en physique et en chimie. J’ai oru ne 
devoir ne regarder tout ce qui a ete fait avant moi que comme des indications: je me suis 
propose de tout repeter avec de nouvelles precautions, afin de Her ce que nous connaissons 
sur Pair qui se fixe, ou qui se degage des corps, avec les autres connaissances acquises et de , 
former une theorie. 

Lavoisier’s own experiments, which are described in the second half of the volume, 
consist mainly of careful measurements of the changes in weight and volume that 
accompany the evolution or absorption of gas in the formation or decomposition 
of carbonates, the neutralization of acids, the reduction of metallic oxides, and the 
calcination of metals. His observations were thus concerned only with carbon 
dioxide and oxygen, but at that time Lavoisier had formed no definite view as to 
the nature of different gases. He was convinced of their fundamental importance 
and his aim was to bring them all into a common theory. 

The experimental part of the paper begins with a repetition and testing of 
Black’s work, using the results, to establish the composition of quicklime, slaked 



Antoine Laurent Lavoisier 


435 


lime and chalk. This was then extended to the carbonates and hydroxides of the 
alkali metals and ammonium, and to the precipitation of metallic solutions by 
alkalis. The amounts of gas evolved by heating metallic calces with charcoal were 
measured and the gas^ was shown to be identical with that from chalk, but owing to 
faulty experimental methods the gas in each case was mixed with air, which must 
have complicated Lavoisier’s conclusions. Finally, he measured the absorption of 
air when metals are calcined or phosphorus burnt in bell-jars over water. 

Lavoisier’s most important conclusion was that in confined spaces only a limited 
amount of calcination takes place, the increase in weight being roughly proportional 
to the volume of air absorbed. Thus it followed that metals could only fix part of 
the air, and the same result was obtained with phosphorus. But no conclusion is 
hazarded as to the nature of the gas which supports combustion, and the last 
sentence of the paper reveals one of the difficulties that must have been puzzling 
Lavoisier’s mind. He says: 

J’ai 6te curiexix, relativement k des vues dont je rendrai compte dans un autre temps, 
d’observer si le melange d’un tiers de fluide diastique des effervescences corrigerait Tair qui 
avait servi k la combustion du phosphore, et lui rendrait la propriety d’entretenir les corps 
enflamm4s. Le melange fait, j’en ai rempli un bocal etroit et j’y ai introduit une bougie, mais 
elle s’y est 4teinte sur-le-champ. 

The paper as a whole is such a clear exposition of a logical sequence of quantitative 
experiments that it is disappointing that it should end on so inconclusive a note. 
Luckily we have m Lavoisier’s notebook the clue to his perplexities. Misled perhaps 
by Black’s name of 'fixed air’ for carbon dioxide (which he abandoned in the 
following year), Lavoisier thought that the gas which was 'fixed’ in alkalis was 
identical with the gas 'fixed’ by metals in their combustion to form calces. This is 
proved by several entries in his notebook; for example, on 1 July 1773 he wrote; 
'Persuade que la combustion du phosphore absorbe Fair fixe contenu dans Fair, 
ou plutot le soup§onnant, j ’ay pens6 qu’en rendant de Fah fixe a cet air, on pourrait 
peut-etre le rendre air commun.’ He tries, of course, without success, but fails to 
push the matter further (17), 

A comparison of the notebook with the paper is interesting in another way. The 
experiments seem to have been made without any planned sequence, but in the 
paper the results are marshalled in perfect order, showing the systematic grasp 
Lavoisier aheady had of the whole field, his doubts only being recorded in the last 
sentences. It was a remarkable pioneer achievement in experimental technique, 
but progress had to wait until Lavoisier knew the relationship between carbon 
dioxide and oxygen. He had still much to learn, despite his brilliant intuition. 

OALCiisrATioisr of metals 

Lavoisier was unshaken in his befief that increase in weight meant chemical 
combination, and immediately the Opvscules was published he began experi¬ 
ments on the calcination of tin and lead in sealed vessels. Boyle had done similar 



436 


Sir Harold Hartley 

experiments but had only reweighed the vessels after opening them, when he found 
an increase in weight which he explained by the passage of fire material throng the 
glass (18). Lavoisier saw the opportunity for a crucial experiment by weighing the 
vessels both before and after they were opened. A summary of his results was 
deposited with the Academy on 24 April 1774, but was not read until the public 
session on 14 November (19), following Lavoisier’s usual plan of keeping a tit-bit 
for that occasion. Without giving any numerical details Lavoisier said that in 
sealed vessels the calcination of lead and tin ceased after an hour’s heating, the 
amount of calx formed being greater in larger vessels. In every case the weight of 
the vessel was unaltered if it was weighed before it was opened. When the seal was 
broken, air was heard to rush in, and the weight of the vessel was then greater, 
the increase being 'exactly proportional to its capacity’. Hence it was clear that 
the increase in weight was due not to the passage of fire-stuff through the glass but 
to something the metal had 'borrowed’ from the air in the vessel which had con¬ 
verted it into a calx. 

Lavoisier ended with a significant sentence about the nature of the air that 
remained after metal had been calcined in it: 

Get air ainsi d4poiiill6 de sa partie fixable (je pourrois presque dire de la partie acide qu’il 
contenoit); cet air, dis-je, est en quelque fa 9 orL d^compos^; et il m’a paru r6sulter de cette 
experience nn moyen d’analyser le fiuide qui constitue notre atmosphere; et d’examiner les 
principes qui le constituent. Quoique je ne sois pas arrive k cet 6gard 4 des resultats enti^re- 
ment satisfaisans, je orois cependant etre en dtat d’assurer que I’air aussi pur qu’on puisse le 
supposer, depouilie de toute humidite et de toute substance etrang^re k son existence et k sa 
composition, loin d’etre un etre simple, un element, coname on le pense commxmement, doit 
etre range au contraire tout au moins dans la classe des mixtes, et peut-^tre meme dans celle 
des composes. 

Lavoisier found that his experiments were not as new as he had thought, and 
he added a note saying that Beccaria had made the same observations on tin 
fifteen years earlier, and that Priestley had pointed out the limited extent of 
calcination in sealed vessels. But the ultimate value of an observation lies in the 
deduction from it, and here Lavoisier alone saw the truth. 

A full account of these experiments was not given until 1777, when Lavoisier 
had to admit that owing to their difficulty only two with tin were completely 
successful and none with lead(20). However, the numerical results of the two 
successful experiments showed the care and skill with which they had been carried 
out, and justified the conclusion that the weight of the sealed vessels was unaltered, 
within the errors of experiment, after calcination had occurred. By then Lavoisier 
had further evidence about the nature of air, which he describes as a mixture of a 
salubrious portion which combines with metals during calcination, leaving 'une 
esp^ce de mofette’ which will not support life or combustion, and is itself 'fort 
compose’. From his results he suspected that the former is the denser of the two, 
air being intermediate. His experiments were, in fact, sufficiently accurate to 
justify this conclusion. 



Antoine Laurent Lavoisier 


437 


These experiments on the calcination of tin must have strengthened Lavoisier’s 
confidence in the correctness of his theoty, and from March 1774 onwards his note¬ 
book shows that he was widening his front of attack, for combustion was only one 
aspect of the revolution he had in mmd(2i). This year saw the beginnings of work on 
a number of problems whose solution eluded him for the moment, as the back¬ 
ground of his knowledge was as yet insufificient. In each there was a quantitative 
approach to determine both the composition of substances and the nature of the 
chemical changes that they undergo. 

He begins by burning inflammable air (hydrogen) which ought on his theory to 
increase in weight, and he was puzzled to find a loss owing to the escape of water 
vapour. This was a problem that continued to perplex hi m for nine years. 

Next came a study of the formation of nitrous ether, also abortive, then an 
attempt to determine the composition of nitre by exploding a mixture of it wdth 
charcoal, and analyzing the gaseous products. Kmowing the percentage of caustic 
alkali in the nitre he tried to calculate its composition, but he did not know how 
much the charcoal contributed to the weight of fixed air. 

In October he is repeating a number of Priestley’s experiments on various gases, 
including his production of an acid by passing electric sparks in air. But it was 
characteristic of Lavoisier’s lack of qualitative intuition that in spite of his interest 
in acids he does not follow up this clue, and the explanation had to wait for Cavendish 
ten years later. 

Next come some revealing thoughts on ‘vegetable analysis’ and plans for future 
work, the birth of organic analysis. 

Nous ignorons: 1® Quelle est la quality de cette immense quantity d’air qui se d^gage 
pendant la distillation: il y a probablement de Fair jfixe et de Fair inflammable; 2® ce que c’est 
que Fhuille: il parait que par la combustion on pent la r6duire en air et en eau; mais nous ne 
savons rien au de-14. Determiner les proportions des deux en brfllant une lampe en un vase 
clos. 3® Ce que c’est que le charbon. Nous savons bien qu’en brfllant il convertit Fair evidem- 
ment en air flxe: mais nous ne savons pas s’il donne lui-meme de Fair fixe. 

Lavoisier is stiU not quite free from the incubus of phlogiston, and he suggests 
burning charcoal in a sealed vessel to see if any phlogiston escapes through the 
glass during combustion with a corresponding loss of weight. 

Pbiestley and oxygen 

Lavoisier’s notebooks show that in 1774 his mind was roving over a wide range 
of problems of great significance but too complex to be solved with his present 
knowledge. He was still puzzled abput the relation of fixed air to the air absorbed 
in calcination, and had tried to find a calx which could be reduced and give up this 
air in the absence of charcoal. It seems clear that Priestley gave him the clue he 
needed when he was in Paris in 1774. On 1 October he told Lavoisier that by 
heating the calx of mercury he had got a gas in which a candle burnt particularly 
brightly. It was not imtil the following 1 March that Priestley realized the re¬ 
markable properties of this gas and thus discovered oxygen. Lavoisier got some 



438 Sir Harold Hartley 

calx of mercury from Cadet, and after some preliminary experiments in November 
three months elapsed until he completed them in three days’ work with Trudaine 
at Montigny. He described the results at the public ses^on of the Academy in 
April 1775 in a paper on 'The nature of the principle that combines with metals 
during their calcination and augments their weight’ (22). 

His main interest was to compare the fixed air he got by heating the calx of 
mercury with charcoal with the gas he got by heating it alone, which to his surprise 
was quite different and 'more respirable, more combustible and consequently 
purer than ordinary air’. His difficulty about the relation of these two gases was 
therefore solved. But qualitative experiments were never Lavoisier’s strong suit, 
and in this paper for once he is inconsistent. Perhaps it was written hurriedly for 
the public session. At one point he describes the 'principle’ absorbed in calcination 
as 'neither one of the constituents of the air, nor a particular acid existing in it, 
but the air itself without alteration or decomposition’. And he has the curious 
idea that the air can be purified whenever it is absorbed in a calx and then set free 
again. In another part of the paper he says that the 'principle’ is the purest part 
of the air. Lavoisier is rarely guilty of such inconsistency, but in this paper he 
clearly failed to realize the fuU significance of the facts he had observed. 

The paper was published in Rozier’s Journal and soon drew a criticism from 
Priestley (23), whose observations were much more accurate: 

Having mentioned the paper of Mr Lavoisier’s,.. .1 would observe, that it appears by it, 
that, after I left Paris, where I procxired the m&rcurms calcinatua abovementioned, and had 
spoken of the experiments that I had made, and that I intended to make with it, he began 
his^ experiments upon the same substance, and presently found what I have called dephlogiati^ 
cated air^ but without investigating the natiire of it, and indeed, without being fully apprised 
of the degree of its purity... .He therefore inferred, as I have said that I myself had once 
done, that this substance had, during the process of calcination, imbibed atmospherical air, 
not in part, but in whole.... As a concurrence of unforeseen and undesigned circumstances 
has favoured me in this inquiry, a like happy concurrence may favour Mr Lavoisier in another j 
and as, in this case, truth has been the means of leading him into error, error may, in its turn, 
lead him into truth. 

It is curious that Lavoisier failed to see at once that the experiments with the 
calx of mercury gave the support he needed for his theory, and that he was so slow 
to complete them. From May until September 1775 his laboratory notebook is a 
blank. The reason for all this was a new preoccupation in his mind. 


Regie des poudres 

' Turgot, a fi:iend of Lavoisier, had just been made Controller-Greneral, the one 
man who might perhaps have saved France from a Revolution, if vested interests 
had not been too strong. Lavoisier was in touch with every side of the administra¬ 
tion, and his critical eye had noted the inefficiency of the national production of 
gunpowder which was made under contract by a financial company. Lavoisier 
saw that their methods of collecting nitre and of making gunpowder were costly 



Antoine Laurent Lavoisier 


439 


and wasteful, and that production was insuificient. He remembered, too, that the 
prohibitive cost to France of imported powder during the Seven Years’ War was 
one cause that had led to the peace of 1763. He succeeded in persuading Turgot 
that it would not only pay the government to cancel the existing contracts with 
compensation and take over the manufacture itself, but that it was essential for 
the national safety. This was done by a decree in March 1775, and Lavoisier and 
three of the former officials were put in charge of the powder factories. 

Lavoisier at once concentrated all his energies on the new task. He moved to the 
Arsenal, where he lived and had his laboratory until 1792. He then began a thorough 
investigation into the methods of making and collecting nitre, nitrification being 
the only source of saltpetre apart from some deposits which he spent three months 
in surveying. His laboratory notebooks for 1775 and 1776 are full of records of 
various investigations into the manufacture of nitre and gunpowder. 

In 1777 he drew up comprehensive instructions for the construction and operation 
of nitrification plants embodying many improvements, which represented a fine 
piece of chemical engineering (24). With his usual eye to finance he calculated that 
the plants should yield a net revenue of 15 % on the capital. Great care was taken 
to select suitable staff, who had to pass an examination in chemistry, mathematics 
and the construction of powder miUs. 

The results were even better than Lavoisier had predicted. In three years the 
improvement of the powder had increased the range of French weapons by 60 %. 
Production increased steadily, and by 1788 there was a reserve of five million 
pounds in the magazines, and the economies had amounted to 20 million livres. 

However, the effect of Lavoisier’s work did not end with the Napoleonic wars. 
He had interested young Irenee Du Pont, the son of the physiocrat, in chemistry, 
and later he became Lavoisier’s assistant at the Arsenal. Ir6n6e and his father 
were constitutionalists, and in 1792 they were fighting beside the Swiss Guard at 
the Tuileries. Saved by a miracle, they emigrated later to America. There on the 
banks of the Brandywine River at Wilmington, Iren6e Du Pont built another 
powder factory destined twice to become a great arsenal of the Allied Nations. 

The disoovebies oe 1776 ahd 1777 

In September 1775 experiments in the laboratory were resumed, though most of 
the work in the autumn was concerned with the manufacture of nitre. But 
gradually Lavoisier began to take up his scientific work again. He dissolves mercury 
in nitric acid, evaporating the solution to dryness and measuring the gases evolved 
when the mercuric nitrate is heated. This he uses as a method of analyzing nitric 
acid, as the experiment ends with the same weight of mercury as at the start. 
From the proportions of nitrous gas and pure air he calculates the percentage 
composition of nitric acid, assuming the other cqmponent to be water. The results 
were read to the Academy in 1776 in a paper "On the existence of air in nitric 
acid ’ (25), which he takes as an illustration of his general thesis that "non-seulement 



440 ' Sir Harold Hartley 

I’air, Tna.ia eiioor© la portion la plus pure de I’air, entre dans la ooinposition de tous 
les acides sans exception; que c’est oette substance qui constitue leur acidity 
He shows how he gets a mix ture of nitrous air and pure air from nitric acid, and 
how by recombining them he gets nitric acid back again. He always tried to verify 
his conclusions both by analysis and synthesis. 

In this paper he twice pays a tribute to Priestley as^the original discoverer of 
many of the facts he describes: ‘comme les mSmes faits nous ont conduits 4 des 
consequences diarndtralement opposdes, j’espdr© que, si on me reproche d’avoir 
©mprunte des preuves des ouvrages d© ce cdldbre physicien, on ne me contestera 
pas au moins la propridte des consequences’. 

This was the only paper he published in 1776, but it was a very busy year in the 
laboratory. Many of Priestley’s experiments were repeated. Work was in progress 
on the analysis of air by the calcination of mercury, on the composition of various 
acids, on respiration, on the burning of a candle, on the latent heat of vaporization, 
and on the heat produced in the neutralization of acids. 

Lavoisier is now quite clear about the nature of the ah and of combustion, and 
for the first time he openly challenges the theory of phlogiston. During 1777 the 
evidence he had accumulated in support of his theory was presented to the Academy 
in nine papers, one of which was not read until 1779, and none was printed rmtil 
1780. Air, he says, is a mixture, four-fifths being an inert gas ‘mofette ’ which takes 
no part in respiration or combustion, and one-Mth a gas ‘eminently respirable’, a 
constituent of all acids, which he therefore named oxygen. The compositions of 
carbonic acid, nitric acid, phosphoric acid, sulphuric and sulphurous acids hfid 
been determined. Respiration and the burning of a candle both convert oxygen 
into carbonic acid or ‘ acide crayeux aeriform© ’, the name Lavoisier has substituted 
for fixed air. Slow combustion in the lungs is suggested as the source of animal 
heat. Piaally, the existence of the three states of matter is shown to depend on their 
heat content, and the heat of combustion is explained by the heat given out by 
gaseous oxygen in its change of state and combination. 

Supported by this evidence Lavoisier attacks the theory of phlogiston, and shows 
that his own explanation is much simpler, involving no unwarranted assumptions 
such as the presence of an immense quantity of fire material in solids like the 
diamond, or that substances gain in weight while losing part of their contents. 

It was a year of great achievement; the problem of combustion was solved, but 
Lavoisier’s theory gained no adherents. It was incomplete, and there were many 
other simple reactions he could not explain. He was stUl puzzled by inflammable 
air, and he had not tried to discover the nature of the inert constituent t)f the 
atmosphere. 

LaVOISEER’s EXPEiajMEKTAL FABM 

The clue to both problems was to come eventually from Cavendish, and 
Lavoisier s notebooks of 1778—82 contain no records of new and important dis¬ 
coveries. His hands must have been foil enough with the Arsenal, the collection of 



Antoine Laurent Lavoisier 


441 


taxes and the business of the Academy. To these he now added a fresh activity— 
an experimental farm. On his many journeys through France Lavoisier, like 
Arthur Young, had seen the poor state of French agriculture and the hard lot of 
the peasants. The staple crop was wheat grown three years out of four, and the 
head of livestock was too small to yield the manure required for good cultivation. 
Lavoisier saw the advantages of the British system of mixed farming, with its 
larger capital investment, more livestock, and wheat grown only one year in 
four. 

So in 1778 he started an experimental farm at Frechines near Blois, in a district 
where the standard of farming was low, in order to investigate and demonstrate 
the possibilities of improvement. Lavoisier always took the big view, and his 
aim now was to increase the wealth of France and to ease the life of the 
peasants. 

The experiments were carried out as far as possible with the same strict control 
as if they had been made in the laboratory. He kept records at Paris of every field, 
its size, soil, crop and yield, with an annual debit and credit account. 

His main objective was to increase the amount of fodder crops and raise the 
number and quality of his livestock. He had some disappointments at the start— 
lucerne failed and clover did badly in dry years, but sainfoin was a success. He 
introduced ley farming on the fallows, catch crops and root crops, and gradually 
increased his livestock. The folding of sheep was another successful innovation. 
No point was too small for Lavoisier, and one of his papers describes in great detail 
a simple way of making hurdles. 

After nine years’ development the farm was in much better shape, but Lavoisiei 
was disappointed at the slowness of progress, and he saw no prospect of getting a 
5% return on his investment (26). This and the shortness of the leases explained 
why French farming suffered so badly from a lack of capital. However, that was 
not the end, as by 1793 his wheat crop had doubled, his stock had multiplied five 
times, and his neighbours were copying his improved methods of farming. 

When Calonne appointed a Consultative Committee on Agriculture in 1785, 
Lavoisier was its secretary and drafted its reports. Among its schemes were 
projects for experimental farms, a museum of agricultural implements, and a 
school of textiles made from home-grown materials, which was actually started in 
Paris. Among Lavoisier’s friends were the leading physiocrats, including Du Pont 
de Nemours and Malesherbes. While he sympathized wdth their views on freedom 
of trade, and regarded agriculture as the primary source of national wealth, he 
recognized that industry, too, had its place in the national economy. One of his 
memoranda dealt with the bad effects of restrictive regulations on agriculture. 
Another, dealing with the many relics of feudalism which handicapped the French 
farmer, reads like pages from Arthur Young’s Diary. A third, ‘Instructions to the 
Provincial Assemblies for improving agriculture’, recommends many of the new 
practices Lavoisier had tried himself at Frechines. In the end, however, little was 
done owing to the lack of interest of the ministers. 



442 Sir Harold Hartley 

Lavoisier’s day 

Lavoisier’s immense output of work in so many fields during this period was 
only made possible by his quickness of mind and memory, his power of concentra¬ 
tion on the matter in hand and his methodical habits. While he was living at the 
Arsenal six hours a day were given to science, in the early morning from 6 o’clock 
to 9 and from 7 o’clock to 10 after dinner. The rest of the day was spent in dealing 
with the business of the Ferme or the powder factories, in the meetings of the 
Academy or of the many Commissions of which Lavoisier was a member. Sunday 
was the happiest day of the week, as he spent the whole of it in his laboratory, 
which soon became a rendezvous for intellectual society in Paris. Scientists, 
ministers, economists and distinguished visitors from abroad went there to see 
Lavoisier’s latest experiment and to join in the discussion of its significance. This 
was the usual preliminary to one of his papers at the Academy. The younger men 
interested in science were equally welcome with their seniors, and they often helped 
Lavoisier with his experiments. The memories of those brilliant gatherings in his 
laboratory, which played no unimportant part in scientific history, remained fresh 
in the minds of those who shared in them. 

Thermochemistry 

One of Lavoisier’s great services to chemistry was to disentangle the chemical 
and physical aspects of chemical change. The phlogiston theory had been re¬ 
sponsible for much loose thinking, since it assumed that this mythical principle 
could modify both the chemical and the physical properties of matter. Lavoisier 
concentrated first on the alterations in weight accompanying chemical change, 
particularly combustion. His belief in the conservation of matter was strengthened 
by his experiments in sealed vessels which showed no evidence of changes in weight 
due to the gain or loss of fire material. This was confirmed by unpublished experi-, 
ments which proved that the weight of sealed tubes of water remained unchanged 
when they were frozen. The distinction between chemical and physical changes 
simplified his task greatly, and by 1777 his evidence on the gravimetric side Vas 
sufficient to enable him to give a satisfactory explanation of the nature of air and 
its role in the chemical changes involved in combustion and respiration. 

Lavoisier was now striving to give a similar picture of the physical changes 
accompanjdng chemical action. He recognized the dependence of the three states 
of matter on their heat content, and he explained the heat of combustion by the 
change in the heat content of the oxygen concerned. He had no idea of the con¬ 
ception of energy, and he did not connect the evolution of heat with the affinities 
of the elements for each other. The difference in the heat of neutralization of 
caustic and mild alkalis (hydroxides and carbonates) he explained by the absorption 
of heat in the liberation of carbon dioxide as a gas. 

Lavoisier had always been interested in thermometry and heat changes since the 
start of his scientific work. In 1772, when Desmarest told the Academy of Black’s 




Portrait of Lavoisier and his wife 
































Antoine Laurent Lavoisier 


443 


work on latent heat, Lavoisier described an experiment made the preceding year 
in which he had discovered independently the same phenomenon, finding to his 
surprise that on mixing water and crushed ice the temperature of the water re¬ 
mained at zero until all the ice was melted (27). In 1773, when studying the crystal¬ 
lization of sodium sulphate by cooling solutions, he observed the temperature arrest 
which occurred when crystallization began and lasted until it had finished. This he 
explained correctly by the heat of crystallization of the salt (28). 

In his physical work he had the advantage of partnership with Laplace, who 
worked in his laboratory at intervals from 1777 to 1785. In 1777 they were 
measuring the vapour pressure of liquids, and determining latent and specific heats 
by the method of mixtures. Black’s unpublished work was known to them, and 
de Luc and Crawford were working in the same field. In 1782, finding the method 
of mixtures unsatisfactory for measuring heats of reactions, they invented a new 
method, the ice calorimeter, in which the amount of heat given out is measured by the 
amount of ice melted (plate 11). The calorimeter consisted of two concentric vessels 
each containing crushed ice, the purpose of the outer vessel being to insulate th6 
inner one against any gam or loss of heat from outside. There was a space in the 
inner vessel to hold the substance or reaction under examination, the amount of 
heat enclosed being measured by the amount of water that ran down from the ice 
as it melted. It was a beautifully simple and accurate apparatus which they used 
to measure specific heats, including the specific heats of gases by a flow method, 
heats of reaction, heats of combustion of phosphorus, carbon, ether and oil, and 
the heats of detonation of nitre mixed with charcoal or sulphur. 

Thermochemistry dates from Lavoisier and Laplace’s great paper ^Sur la 
Chaleur’ of 1783 ( 29 ), and one of their generalizations is a partial anticipation of 
Le Chateher’s Theorem which came a century later: ‘Dans les changements 
causes par la chaleur a I’etat d’un systeme de corps, il y a toujours absorption de 
chaleur; en sorte que I’etat qui succede imm6diatement a un autre, par une addition 
suffisante de chaleur, absorbe cette chaleur sans que le degre de temperature du 
systeme augmente.’ 

More measurements were made in the following winter, including the heats of 
combustion of hydrogen and of wax, which Lavoisier showed was nearly equal to 
the sum of the heats of combustion of the hydrogen and carbon it contained. 

The ice calorimeter thus gave Lavoisier precise data for the heat changes in¬ 
volved in the chemical reactions m which he had already determined the weight 
changes involved. He now had a complete picture of these two different aspects 
which enabled him to show how unnecessary it was to assume the existence of 
phlogiston, as everything could be explained much more simply without its aid. 

In 1777 Lavoisier had explained the source of animal heat as being the heat 
evolved when carbonic acid {air crayevxc) is produced in the lungs from the oxygen 
of the air during respiration. This was an entirely new point of view, as Haller 
had accepted Stahl’s explanation that the heat of the body was due to the friction 
of the blood in the arteries, though he was baffled by the constancy of body 


Vol. 189. A. 


29 



444 


Sir Harold Hartley 

temperature. The ice calorimeter gave the opportunity to verify this speculatioji 
by measuring the heat given out by a guinea-pig and comparing it with the heat 
evolved in the formation of the volume of carbonic acid which an animal of similar 
weight expired during the same period. 

Krst the amount of carbonic acid expired by guinea-pigs in a period of ten hours 
was measured by absorbing it in potash bulbs. From the mean of their experiments 
Lavoisier and Laplace calculated that the combustion of the corresponding amount 
of carbon would have melted 10*38 oz. of ice. A guinea-pig was then kept in a 
calorimeter for the same period and the heat given out by it melted 13 oz. of ice. 
Part of the excess, they said, might have been due to the cooling of its extremities— 
it might well have had cold feet! From the results they concluded: ^ la conservation 
de la chaleur (animale) est due, au moins en grande partie, h, la chaleur que produit 
la combinaison de Tair pur respire par les animaux avec la base de Tair fixe que le 
sang lui fournit’. The maintenance of a constant body temperature under different 
climatic conditions they ascribed to differences in the rate of evaporation of 
moisture rather than to differences in the amount of carbonic acid formed during 
respiration. 

This was one of Lavoisier’s most brilliant papers, remarkable for the beauty of 
the experimental technique, for the directness of attack, and for its verification of 
his theory of animal heat, which gave a new significance to metabolism. No doubt 
the paper owed much to Laplace, who was the first convert to Lavoisier’s theories. 
The full significance of their eight years of partnership has not had the attention it 
deserves. 


The nattjee and composition of water 

His notebooks show repeatedly how puzzled Lavoisier stiU was as to the nature 
of the inflammable air he got in various ways, by the solution of metals in acids, 
or the . distillation of vegetable substances. In 1774 he had tried and failed to 
ascertain the increase m weight when inflammable air was burnt. He expected 
inflammable air to give an acid on combustion but could find none. Since the in¬ 
flammable air came from sulphuric acid and metals Lavoisier thought it should 
give sulphuric acid. Bucquet thought it should give fixed air. Experiments in 
1774 and 1777 showed that neither acid was produced. He returned to the problem 
again in 1781 and 1782, when he and Gingembre burnt a jet of oxygen in inflam¬ 
mable air and found neither carbon dioxide nor any acidity, which stiU surprised 
Lavoisier' que I’analogie m’avait porte mvinciblement a conclure que la combustion 
de I’air inflammable devait egalement produire un acide’. Hence m 1782 he had 
no idea that water was the product. 

Meanwhile other chemists were busy with the same problem, and Cavendish had 
found that when a mixture of inflammable air and common air is exploded in a 
closed vessel water is formed without any change in weight. When Blagden, the 
Secretary of the Eoyal Society, was in Paris, he told Lavoisier of Cavendish’s 
experiments. On 23 June 1783, Lavoisier took two gas holders containing in- 



Antoine Laurent Lavoisier 


445 


flammable air and oxygen and joined them up to a jet at which the mixture could 
be burnt for a considerable time in a bell-jar over mercury. The first experiment 
was made by Lavoisier and Laplace in the presence of Blagden and a number of 
other scientists. A considerable quantity of pure water collected on the surface of 
the mercury, and although it was not possible to establish directly that the weight 
of the water found was equal to that of the two gases Lavoisier had no doubt of it. 
He wrote 'comme il n’est pas moins vrai en physique qu’en geometric que le tout 
est egal k ses parties, de ce que nous n’avions obtenu que de Teau pure dans cette 
experience, sans aucun autre r4sidu, nous nous sommes crus droit d’en conclure 
que le poids de cette eau etoit egal a celui des deux airs qui avoient servi a la 
former ’(30). 

The next day the experiment was described to the Academy. ‘Nous ne 
balangS-mes pas a en conclure que I’eau n’est point une substance simple, et qu’elle 
est composee poids pour poids d’air inflammable et d’air vital.’ Lavoisier then 
heard that Monge had made similar experiments. Having determined the com¬ 
bining volumes of the two gases he had found that the weight of the water was 
almost equal to that of the two gases, the specific gravities of which he had 
determined. 

Lavoisier must have seen in a flash the wide significance of this new view of the 
nature of water, which gave him the clue to so many of his unsolved problems. It 
(explained the source of the water he got in organic combustions, and he saw the 
possibility of using them to determine the composition of organic bodies. 

Additional evidence was obtained by studying the decomposition of water by 
long standing over cold iron, and by passing steam over hot iron or charcoal, as 
well as from experiments on its formation by passing inflammable air over heated 
metallic calces, thus extending some observations made by Priestley. 

Lavoisier was naturally anxious to know the percentage composition of water, 
as it would enter into so many of his calculations. In his first paper to the Academy 
he gives the following figures based on experiments made with Meusnier to deternme 
the combining volumes of the two gases and their densities: 

Livres 

Air vital, ou plutdt prmcipe oxygiae 0*86866273 

Air ioflaminable, ou plutdt prmcipe inflammable de I’eau 0*13133727 

Total 1-00000000 

Lavoisier at this stage has a preference, like Boyle, for the word ‘principle’ 
rather than ‘element’. The name of hydrogen was not adopted until the whole 
nomenclature was revised four years later. 

There are numerous references to a new apparatus that was constructed to prove 
that the weight of water was equal to that of the gases from which it was formed, 
and to determine its composition. Arthur Young saw it in Lavoisier’s laboratory 
in 1787. ‘In the apparatus for aerial experiments’, he writes in his Diary (3i) 
‘nothing makes so great a figure as the machine for burning inflammable and vital 


29-2 • 



446 


Sir Harold Hartley 

air, to make or deposit water; it is a splendid machine. Three vessels are held in 
suspension with indexes for marking the immediate variations of their weights; 
two that are as large as half hogsheads contain the one inflammable, the other the 
vital air, and a tube of communication passes to the third, where the two airs 
unite and bum;... .If accurate (of which I confess I have little conception) it is a 
noble machine.’ 

It has been questioned whether Lavoisier ever used this apparatus successfully, 
as the results were never published by him. The experimental details, however, are 
given in the Traite. Hydrogen was burnt at a fine jet in a glass globe containing 
oxygen, both gases having been dried by passing over potash (Lavoisier says that 
calcium nitrate or chloride would have been better). The increase in weight of the 
globe together with measurements of the volumes of the gases consumed, and a 
knowledge of their densities, gave Lavoisier the data he needed. Details of one 
experiment were published in an unsigned article in the Journal Polytype^i^"^)^ an 
ephemeral scientific journal in which Lavoisier was interested. The weight of water 
foxmd was within 1 % of that of the two gases, and its composition is given as 
85 % oxygen and 15 % hydrogen, which are the figures Lavoisier used subsequently 
in all his calculations. The hydrogen content was too high owing to errors in the 
determination of its density. 

It was a very difficult experiment and a great test of Lavoisier’s skill. The final 
solution had to wait until Morley’s classical investigations just a century later. 

Lavoisier lost no time in exploiting this new view of the composition of water. 
He saw that it explained the formation of water in organic combustions, and he was 
now able to use them to determine the composition of organic bodies. He deter¬ 
mined the heat of combustion of hydrogen and saw that the oxidation of hydrogen 
as well as carbon in the body would account for more oxygen being consumed in 
respiration than the expired carbonic acid could account for. This would also 
explain why the amount of heat produced by an animal was greater than that 
calculated from its output of carbonic acid. 

The new chemistby and the bbeobm of nomenolatuee 

The recognition of hydrogen as an element was a great advance. It explained 
many of Lavoisier’s perplexities, and shortly afterwards Cavendish discovered that 
the inert part of the atmosphere, Lavoisier’s ‘mofette’, gave nitric acid when 
sparked with oxygen, and BerthoUet found that ammonia is a compound of 
‘mofette’ with hydrogen. 

The way was now clear. With the recognition of the elements oxygen, hydrogen, 
nitrogen (or azote as the French chemists called it), carbon, sulphur, phosphorus 
and the metals, Lavoisier could explain quite simply the composition of chemical 
compounds, their reactions and their quantitative relations. There were no longer 
obscurities to serve as a stronghold for phlogiston. Laplace and some of the 
younger physicists were already on his side. Of the chemists, BerthoUet announced 



Antoine Laurent Lavoisier 


447 


his conversion at the Academy in August 1785, followed soon by Tourcroy and 
Guyton de Morveau. The latter had published proposals for a reform of chemical 
nomenclature in 1782, but these were based on phlogiston. Discussions between 
the four chemists in 1786 led to their publication jointly of a new method of 
chemical nomenclature based on Lavoisier’s theory (33). In the introduction 
Lavoisier acknowledges his debt to CondiUac, the disciple of John Locke. From 
Condillac’s Logic he had learnt the value of clarity of expression, of language as an 
instrument of analysis, of words as the symbols in the algebra of thought. 

Till then phlogiston had been the only common basis. Names had had little 
reference to the nature or relationships of the substances that bore them, and many 
were relics of alchemy. Lavoisier’s work had made a system possible, in which the 
names of elements like hydrogen and oxygen told something about them, and the 
names of compounds like sulphates and sulphites indicated their composition. The 
system devised by the four chemists, with few exceptions, is the one we use to-day, 
and its adoption helped greatly towards the general acceptance of the new theory. 


Traitj^ elj^mentaire 

The nomenclature having been agreed upon, Lavoisier was drawn irresistibly, as he 
teUs us, to use it to give a logical account of his new theory. His Traite elementaire 
de chimie{B4.) was finished early in 1789. Its publication marks the birth of modern 
chemistry, and it threw a new light on all the related sciences. It was no ordinary 
textbook, as the first person singular appears on every page. It was in fact an 
autobiography of Lavoisier’s work since 1772. The discoveries of seventeen years 
were compressed into a few pages, and the experimental basis of the new chemistry 
presented in such a clear compelling fashion that it won immediate acceptance. 
It was chemistry as Lavoisier saw it without any history or even a mention of 
phlogiston. 

The first part outlines Lavoisier’s theory of chemistry and gives the results of 
the crucial experiments on which it was based. ‘C’est eUe seule qui contient 
I’ensemble de la doctrine que j’ai adoptee; c’est a elle seule que j’ai cherche a 
donner la forme veritablement elementaire.’ 

After giving his views on the nature of heat, he describes the experiments which 
established the composition and chemical nature of the atmosphere, water and a 
number of acids, gradually building up his picture on experimental results. He 
then sets out his views on organic chemistry, his experiments on fermentation, and 
his work on the analysis of organic compounds by combustion. 

Oxygen is the central element, a constituent, Lavoisier thought, of all acids and 
aU bases. Hence he predicted rightly the existence of the alkali metals isolated 
twenty years later by Davy. But his assumption of the presence of oxygen in all 
acids was less happy, and here again Davy established the facts. 

The Traite was the forerunner of the textbooks of the nineteenth century, 
and some of Lavoisier’s views, the dualistic nature of salts and the conception of 



448 


Sir Harold Hartley 

compound radicals, were to have a profound influence on chemical thought and to 
be the issues on which many battles were to be fought. 

The idea of the chemical equation had been implicit in all Lavoisier’s quantitative 
work: 

Car rien ne se cr4e, ni dans les operations de Fart, ni dans celles de la nature, et Ton pent 
poser en prmcipe que dans toute operation, il y a une egale quantite de matiere avant et 
apres Foperation; que la qualite et la quantite des principes est la meme, et qu’il n’y a que des 
changenaens, des modifications.. .je puis considerer les matieres mises fermenter et Je 
resultat obtenu apres la fermentation, comme une equation algebrique. 

T R A I T E 

£Ii£mentaire 

DE CHIMIE, 

rRiSENrt DANS ON ORDRE NOUVEAU 

ET d’aPRES les DfcOUVERTES MODERNES} 

Avec F^res: 

Par M. Lavoisxsh , de TAcadimU des 
Seieaees , de h. Soeiite Ra^aU de Medeeine^ des 
SoeUtis iPAgriculture de Paris & d!OrUms^de 
la Soeiite Raj^ale de Londres , de Cinjlitui de 
Soloffie t de la SocUti Helvitique de Bade , de 
e^les de Philadelphie , Harlem , Maaehefier , 

Padoue t &e. 


TOME’PREMIER. 



A P A R1 

Chez CffCHET^Libiaire, rue & hotel Serpemc. 
M. DCC. LXXXIX. 

Solu U Priviltge^ VAcadimit des Sciences 6 dc le 
Societi RoyaU de Meiecine* 

Figure 2, Frontispiece of the Traiti, 1789. 




Antoine Laurent Lavoisier 449 

The first time that Lavoisier actually used the familiar form of the chemical 
equation was in the chapter on alcoholic fermentation when he wrote: 

Le mout de raisin=aeide carbonique+alkool. 

The second part of the Traite contains lists of salts with general methods for 
their preparation from each acid. It is a useful summary containing nothing ^ qui 
me soit propre 

The first part gave few experimental details: ^J’ai reconnu.. .que des descrip¬ 
tions minutieuses.. .figuroient mal dans un ouvrage de raisonnement; qu’elles 
interrompoient la marche des idees, et qu’elles rendoient la lecture de Touvrage 
fastidieuse et difficile.’ In the third part Lavoisier gives a full account of the con¬ 
struction of the apparatus and the experimental methods which he had gradually 
developed. It is of intense interest in revealing the care and skiU he devoted to his 
experiments, most of which were made with his own hands. Its value is enhanced by 
many illustrations drawn and engraved with loving care by Madame Lavoisier who 
had so often seen the apparatus in use in the laboratory. The book has the same 
individual quality as Faraday’s Chemical Manipulation. It begins with an appeal 
for the use of the decimal system and ends with the prototype of future tables of 
physico-chemical constants. 

The publication of the Traite marks the end of the phlogistic period and the 
beginning of modem chemistry. The clear logic of its presentation won immediately 
almost general acceptance. It was translated into several languages and became 
the accepted method of teaching. It had been a long fight, but in the end Lavoisier 
won an almost bloodless victory. In 1791 he wrote to Chaptal: ‘Toute la jeunesse 
adopte la nouvelle th4orie et j’en conclus que la revolution est faite en chimie.’ 
Even Kirwan in 1792 wrote to BerthoUet: ‘Enfin, je mets bas les armes et j’aban- 
donne la phlogistique.’ 

ReSPIRATIOIS^ and METABOmSM 

After the publication of the Traite Lavoisier’s interests turned to organic and 
physiological chemistry. Almost his last scientific work was on respiration and 
metabolism. In 1785, in a brilliant lecture to the Society of Medicine, he described 
experiments on birds and guinea-pigs, showdng that they could live for long periods 
in pure oxygen and that, to his astonishment, the rate of production of carbon 
dioxide was practically unaltered (35). He found later that they could live equally 
well in an atmosphere in which hydrogen was substituted for the nitrogen of the 
air. He saw that water was formed as well as carbon dioxide in the process of 
metabolism, thus accounting for what is now called the "respiratory quotient’, 
and he was able to calculate the amount of water produced. The heat evolved in 
its formation explained why his measurements with Laplace had shown that an 
animal gave out more heat than could be accounted for by the heat of formation 
of the carbon dioxide it expired. 

From his observations on the distress of animals breathing in confined spaces as 
the concentration of oxygen diminished and that of carbon dioxide increased, 



450 Sir Harold Hartley 

Lavoisier’s practical mind travelled to the state of the atmosphere in crowded 
rooms. He measured its composition in hospital wards and in the Com6die 
Fran 9 aise. Finding a deficit of oxygen and an increase of 2 or 3 % of carbon dioxide, 
he urged the need for better ventilation. He calculated the rate at which the 
atmosphere deteriorated, producing that ^impatience machinale’ in an audience 
which was such a sad handicap to the last speaker at a meeting of the 
Academy. 

Finally, he saw the danger that emanations from the lungs might spread disease 
in crowded places, and he urged the need for knowledge of the ways in which 
infection spreads in order to protect the health of people in large towns. The lecture 
was a brilliant forecast of the problems of ventilation. 

Experiments on a human being were the culmination of Lavoisier’s work on 
respiration and the subject of his last two communications at public sessions of the 
Academy in 1789 and 1790 (36). Seguin, his co-author, was the experimental subject. 
Measurements were made of his consumption of oxygen, his output of carbon 
dioxide, his rate of respiration and his pulse rate under different conditions. We know 
from sketches made by Madame Lavoisier the general experimental arrangement 
(plate 11). The results revealed the main factors regulating metabolism: the tem¬ 
perature of the environment, the output of work, and the digestion of food. Lowering 
the room temperature from 80 to 61° F raised the absorption of oxygen by a man at 
rest by 11 %, the digestion of food raised it by 60%, doing work at the rate of 
37,000 ft.lb. an hour raised it by 160 %, and tjtie digestion of food simultaneously 
with work at the rate of 39,000 fb.lb, an hour raised it by 280 %. Lavoisier showed 
also that the increase in the pulse rate was proportional to the amount of work done, 
and that the amount of oxygen absorbed was proportional to the product of the 
pulse rate and the number of inhalations. 

He saw at once the secret of the constancy of the body temperature under varying 
conditions of climate and occupation that had so long remained a mystery. It is 
regulated automatically by three balancing factors—^respiration which produces 
heat by oxidation of carbon and hydrogen, transpiration which increases and 
diminishes as needed to remove any excess of heat by evaporation of water from the 
skin, and digestion which replaces the matter lost in the first two processes. The 
directness of the attack, the swift recognition of the wide implications of the 
results, are characteristic of Lavoisier at his best. 

Finally, he discusses their bearing on the state of the body in health or disease. 
Fevers he thought were nature’s method of restoring any disturbance of the 
equihbrium. The art of the doctor, he said, often consists in letting nature deal with 
herself, helped by suitable diet and purgatives. Contaminated atmospheres were, 
he thought, the cause of endemic disease and hospital and prison fevers. The best 
remedies were open air, freer breathing and a change of environment. 

He left open the question as to whether oxidation actually took place in the lungs 
or in the course of circulation. Investigations already in hand on digestion and 
transpiration would throw further light on this. 



Antoine Laurent Lavoisier 


451 


The first paper with Seguin on transpiration (37) described an attempt to get a 
complete balance of the respiratory processes and ascertain the amounts of water 
lost through the skin and through the lungs. During the experiment Seguin was 
enclosed in.a rubber bag, and his loss of weight was measured both with and 
without the bag. In addition, his absorption of oxygen and output of carbon 
dioxide were determined. Lavoisier could then calculate the loss of weight due to 
each cause on the assumption that all the oxygen absorbed formed carbon dioxide 
or water. If oxidation took place in the circulatory system he was uncertain 
whether this assumption was justified, and regarded the results as provisional. 

A second paper on transpiration published after Lavoisier’s death deals specially 
with the moisture content of air as determining comfort conditions, as to which, 
he said, a thermometer reading may be quite misleading (38). The paper deals also 
with the purpose and design of clothing. 

The trend of Lavoisier’s thoughts at this time are shown by the programme 
drafted by him for the Academy Prize of 1794(39). His introduction deals with the 
problem of nutrition, the animalization, as he calls it, of vegetable and animal food. 
He points out the lack of knowledge of the changes that take place in the various 
stages of digestion, which had led the Academy to choose an investigation into the 
fimctions of the liver and bile as the subject of the Prize. A broad treatment was 
suggested, including the comparative anatomy of the liver and gaU bladder, the 
chemical nature of the bile, and the pathology of the liver. The investigations were 
to cover chemical researches, especially the new methods of organic analysis, 
Lavoisier’s own invention. The programme was almost a forecast of some of the 
major developments in physiological chemistry in the nineteenth century. 

Lavoisier was the first to submit the vital functions to an exact physico-chemical 
analysis. He laid the foundations of physiological chemistry. His investigations 
into the source of animal heat first disclosed the fuU significance of metabolism. 
If he had lived on, what contributions he might still have made! But it was not 
to be. 


Politics aistd the eevolutioit 

No Frenchman saw more clearly than Lavoisier the dangers that were gathering 
for France—^the state of her finances, her outworn social structure with all its 
inequalities and injustice, and the selfishness of privilege with its lack of patriotisrn. 
No one had done more than Lavoisier to forecast the reforms that had to come, and 
when Neoker in 1787 set up the Provincial Assemblies to replace the effete local 
Governors he saw a great opportunity. Chosen to represent the Third Estate at 
Orleans he was soon its outstandmg figure. ‘It is Lavoisier who does everything, 
enlivens everything, and is everywhere’, said Leonce de Lavergne. We still have 
his memoranda—clear, practical, convincing, and amazingly modern in outlook— 
on the state of agriculture, the freedom of commerce, old age insurance, savings 
banks, infant mortality, the need for a geological survey, and above aU the reform 



452 Sir Harold Hartley 

of taxation on an equitable basis (40). But privilege stood in the way and little 
was done. 

The tide was running swiftly, and when the Estates General were summoned by 
the King in 1789 Lavoisier had fresh hopes. To his great disappointment he was 
only elected as a substitute deputy, but again he threw himself into the battle. 
He was a leading member of the Club of’89 which stood for constitutional monarchy. 
TTir memoir on the Territorial Wealth of France was ordeired to be printed by the 
National Assembly (4i). It was an attempt to estimate the national income and its 
taxable capacity, a pioneer effort in statistics, made possible by the information 
Lavoisier had collected from every province through the organization of the 
Ferme. 

Many of the reforms Lavoisier had foreseen were soon accomplished, but early 
in 1790 there is already an anxious note in a letter to Benjamin Franklin: (42) 

Apres vous avoir entretenu de ce qui se passe dans la ohimie, ce serait bien le cas de vous 
parler de notre revolution politique; nous la regardons comme faitte et bien faitte sans 
retour;.. .Les personnes moderees et qui ont conserve leur sang-froid dans cette effervescence 
generale, pensent que les circonstances nous ont entraines trop loin.. .et qu’il est k craindre 
que retablissement de la nouvelle constitution n^eprouve des obstacles de la part de ceux 
memes en favour de qui elle a ete faitte.. ..Nous regrettons bien dans ce moment votre 
eioignement de France; vous auries ete notre guide et vous nous auries marque les bornes 
qui nous n’aurions pas du franchir. 

In almost the last of Lavoisier’s scientific papers there was a strangely prophetic 
passage: 

Faisons des vobux surtou1> pom que Fenthousiasme et Fexageration qui s’emparent si 
facilement des hommes reunis en assembiees nombreuses, pour que les passions humaines 
qui entrainent la multitude si souvent centre son propre interet, et qui compreiment dans 
leur tourbillon le sage et le philosophe commes les autres hommes, ne renversent pas un 
ouvrage entrepris dans si belles vues, et ne detruisent pas Fesperance de la patrie. 

The violence Lavoisier feared came quickly. Members of the Club of ’89 were 
marked men because of their moderation. He himself was attacked by Marat, 
whose ridiculous pamphlet, TraiU du Feu^ he had criticized. One by one he had to 
give up the ofidces which he had served so long and ably. In 1791 the administration 
of taxes was taken from the Ferme, and in 1792 he resigned from the Arsenal. 
Even the Academy was not safe. Lavoisier as its treasurer fought hard for its 
existence, but some of its own members turned against it, and it was suppressed 
in 1793. 

Bjnowing the financial difficulties of France he offered his services in various 
capacities, but was refused. In 1792, however, the King wished to nominate biTn 
as Minister of the Public Funds, an office Lavoisier would have occupied so gladly 
in happier days. He knew it was too late, that he was suspect as a member of the 
old regime, and no politician. He refused, saying, ‘ je ne suis ni jacobin, ni feuillant 

Lavoisier found another outlet for his activities in these difficult years. In 1791 
the National Assembly had set up a Consultative Committee on Arts and Crafts 
to advise the government on various questions, including useful inventions. 



Antoine Laurent Lavoisier 453 

Lavoisier, as usual, seems to have drafted all the reports, the most important of 
which was entirely his ovqi(43). It was on a system of public education, a subject 
to which he had given much thought, having started at his own expense a primary 
school at Villefrancoeur. 

The report dealt with the general principles of education, and its national im¬ 
portance. It proposed a scheme covering the whole field ficom the primary school 
to the Lyc6e. Under his plan all children at the age of six were to enter a primary 
school, and at eleven were to go on to an elementary school teaching either the 
arts or the arts and sciences. Finally, there were to be twelve lycees giving the 
highest form of public education with a wide choice of subjects. Lavoisier had an 
eye to practical subjects like hygiene, the weather and simple surveying. Country 
schools were to teach the elements of agriculture and town schools the elements of 
commerce. 

Lavoisier is emphatic as to the importance of education in everjf walk of life. 

Le mot irvdxLstrie n’exprime pas toujours im emploi de forces, ni m^me d’adresse, il exprime 
le plus souvent un emploi des facult^s de I’esprit... .Le oultivateur qui prospers le plus n’est 
pas toujours celui qui est physiquement le plus fort et le plus adroit; c’est celui qui est le 
plus intelligent.... Organisez I’instruction publique dans toutes ses parties; donnez du mouve- 
ment aux arts, aux sciences, k Tindustrie, au commerce. 

In 1791 the National Assembly had entrusted to the Academy the task of 
establishing a uniform system of weights and measures, a very necessary reform 
which resulted in the metric system. A commission was set up consisting of Borda, 
Lagrange, Laplace, Monge and Condorcet with Lavoisier as secretary and 
treasurer. The work was divided between various groups; Lavoisier and Haiiy 
were to determine the density of distilled water at zero temperature. Lavoisier 
was also responsible for the administration of the commission, which continued 
after the suppression of the Academy. It was his last scientific task (44). 

Meanwhile, in the general attack on the ancien regime^ feeling was running high 
against the members of the Ferme, and in 1793 they were arrested on charges of 
maladministration. Lavoisier asked to be allowed to continue his work on the 
standards, and Haiiy and Borda pleaded for him in vain to the Convention, 
risking their own lives. All his wife’s efforts, too, were unsuccessftd. 

When the charges against the Ferme were formulated, it was Lavoisier who drew 
up their defence (45). He gave a masterly review of their financial transactions, 
showing that they had done their work efiSlciently and without undue profit to 
themselves. But it was the height of the Terror, the trial was a mockery of justice, 
and Lavoisier and twenty-seven of his colleagues were guillotined a few hours later. 
Lavoisier followed his father-in-law, Paulze, to the scaffold, and met death with 
the courage and philosophy of his generation. 

Thus died this great Frenchman in his fifty-first year at the height of his achieve¬ 
ments. The loss to France and to the world was immeasurable. The Terror had 
nearly run its course; within a few years came reconstruction, when under Napoleon 
science once more came into its own, Wh.at a contribution Lavoisier might have 



454 Sir Harold Hartley 

made to the Industrial Revolution in France, with his vision of her needs, his 
imaginative use of science in industry, agriculture and hygiene, and his administra¬ 
tive skill and experience. 

If in the fields of politics and economics he was cut oiF before his work came to 
fruition, he had foreseen the changes soon to come in industry, in social legislation 
and in education. His eloquence and writings must have left their mark on the 
minds of those who were to remake France. 

In science it was otherwise. There his work was done with almost incredible 
effectiveness and speed. Ifo other revolution in scientific thought has ever been 
so complete, so swift, or has done so much to clear men’s minds of the cobwebs of 
centuries. He quickened the advance of science over its whole front. Not only 
chemists, but workers in physics, biology, medicine and agriculture were given for 
the first time a clear view of the various forms of matter we call elements, and of 
the distinction between chemical and physical changes. Lavoisier put new and 
potent weapons into their hands, just when they were needed for science to play 
its full part in the nineteenth century. Compare the clear-cut vision of Lavoisier’s 
great treatise, based on quantitative measurement, on the methods of analysis and 
synthesis, and on the notion of the chemical equation, with the confused picture 
of chemistry in Macquer’s textbook, with no general conception but the theory 
of phlogiston. It was no wonder that progress was quickened. Dalton’s atomic 
theory, Davy’s discovery of the alkali metals, his recognition of the halogens as 
elements, and the massive contributions of. Berzelius followed swiftly, completing 
the picture and giving chemistry its modern form. 

It was, indeed, a revolution, and I have tried to show how Lavoisier fought the 
battle for the New Chemistry almost single-handed, for he had no school, with 
many other preoccupations in one of the busiest of lives. He was fortunate in his 
great contemporaries, Black, Priestley and Cavendish, who so often gave him the 
clue he was searching for, but he alone saw the significance of their discoveries. 

What was the secret of his greatness? Lavoisier was above all things a reformer 
with an intense desire to improve everything he saw around him, whether it was 
the confused state of chemistry, the system of taxation, the lot of the peasant, the 
making of gunpowder or the methods of agriculture. He had the logical systematic 
mind of his countrymen, joined with great constructive power and a creative urge. 
Burning curiosity made him a born researcher. Every problem was to him a 
challenge for research and experiment, and for him research meant measurement, 
the method of modem science. He had an essentially modern outlook. He was 
not bound by tradition or authority. He looked his problems squarely in the face, 
and judged them on evidence with the shrewd common sense he showed as a m^/n 
of affairs. 

Lavoisier’s great strength was his realism. He saw each problem in its smallest 
details^ and no one examined them with greater care, for he was a great experi¬ 
menter; but he also had great vision and saw his problems in their widest implica¬ 
tions. He gave his visions quantitative form, in chemistry with the balance, in 



Antoine Laurent Lavoisier 


455 


j&nance with statistics. He applied to any practical problem the same analysis and 
measurement that he used in the laboratory. It is often said that scientists are too 
remote from real life. Lavoisier is the outstanding example of the value of the 
laboratory mind in action. That is his lesson for us to-day. 


I am greatly indebted to previous biographers of Lavoisier, and especially to the late 
Professor A, N. Meldrum for his scholarly researches on Lavoisier’s early work. I have to 
thank also my secretary, Miss Josephine Wasse, without whose never-failing help the prepara¬ 
tion of this lecture during wartime would have been impossible. 


Notes 


(1) (Euvres de Lavoisier, 1864-93, Paris: Lnprimerie Imperiale. 

(2) (Euvres, 3, 765. 

(3) (Euvres, 5, 187. 

(4) (Euvres, 3, 1. 

(5) (Euvres, 3 and 4. 

(6) Arthur Young, Travels in France and Italy, 1934, p. 78. London: Pent. 

(7) Gouverneur Morris, A Diary of the French Bevolution, 1939, passim. London: Harrap. 

(8) (Euvres, 3, 106, 128. 

(9) (Euvres, 2, 1. 

(10) (Euvres, 2, 38. 

(11) (Euvres, 2, 64. 

(12) A. N. Meldrum, The Eighteenth Century BevoluMon in Science—the First Phase, 1930. 

Calcutta: Longmans, Green. Lavoisier’s three Notes on Combustion: 1772. Archeion, 
1932, 14, 16. 

(13) A. N. Meldrum, op. cit. 

(14) (Euvres, 2, 103. 

(16) (Euvres, 1, 439. 

(16) Berthelot, La Rivolulion Chimique: Lavoisier, 1890, p. 48. Paris: Germer Bailli6re. 

(17) Berthelot, op. cit. p. 246. 

(18) Robert Boyle, Essays of the Strange Subtility 1 

Deterrmnate Nature V of Effluviums 
Great Efficacy ] 

To which are annext New Experiments to make fire and flame ponderable: togeHher with a 
Discovery of the perviousness of glass, 1673, passim. London: W. G- for M, Pitt. 

(19) Lavoisier, Observations sur la Physique, edited Abb6 Rozier, 1774, 4, 448. 

(20) (Euvres, 2, 105. 

(21) Berthelot, op. cit, p. 257. 

(22) (Euvres, 2, 122. 

(23) Joseph Priestley, Experiments and Ohservalions on Different Kinds of Air, 1774, 2, 320. 

London: J. Johnson, 

(24) (Euvres, 5, 391. 

(25) (Euvres, 2, 129. , 

(26) (Euvres, 2, 812. 

(27) Lavoisier, Observations sur la Physique, 1772, 2, 510. 

(28) Lavoisier, Observations sur la Physique, 1773, 1, 10. 

(29) (Euvres, 2, 283. 

(30) (Euvres, 2, 334. 

(31) Arthur Young, op. cit. p. 78. 

(32) Journal Polytype, 5 February 1786. 



466 Sir Harold Hartley 

(33) Guyton de Morveau, Lavoisier, Berthollet and de Fourcroy, Mithode de Nomenclature 

Ohimique, 1787 . Paris. 

(34) (Euvrest 1, 1. 

(35) CEuvres, 2 , 676. 

(36) CEuvreSy 2, 688 ; Annales de Ghimie, 1814 , 91, 318. 

(37) CEuvreSy 2, 704. 

(38) Lavoisier, Annales de Ghimie, 1814 , 90, 6 . 

(39) (JEuvreSy 6 , 33. 

(40) (Euvresy 6 , 238. 

(41) (EuvreSy 6 , 403. 

(42) E. Grimaux, Lavoisiery 1899 , p. 201. Paris: Germer Bailliere. 

(43) (EuvreSy 6 , 516. 

(44) (EuvreSy 6 , 670. 

(45) CEuvres, 6 , 570. 


Determination of the velocity coefficients for 
polymerization processes. 

I. The direct photopolymerization of vinyl acetate 

By Gr. M. Btjrnbtt aito H. W. MBLvnjCiB, r-R-S. 

Chemistry Department, University of Aberdeen 

{Beoeived 6 November 1946—iieaci 6 February 1947) 

This paper describes a method whereby the individual velocity coefi&cients of a vinyl 
pol 3 naierizafcion reaction have been determined by a detailed analysis of the kinetics of the 
photo-polymerization of liquid vinyl acetate. The evaluation of the coefficients is made by the 
development of an accurate method for measuring the life time of the active polymeric 
molecules employing intermittent radiation. In addition, the kinetic chain length and the 
number average molecular weight of the polymers have been measured. It is shown that 
each absorbed quantum of radiation activates one vinyl acetate molecule probably as a 
diradical. The chain terminates by a disproportionation reaction between the polymer 
radicals. The size of the growth or propagation bimolecular coefficient decreases only slightly 
as the molecule grows, the energy of activation is 4400 cal. and the steric factor of the growth 
reaction is 10~®. In the termination reaction there is likewise only a small decrease of velocity 
coefficient with increasing molecular size—^no energy of S/Ctivation is required and the steric 
factor has the surprisingly large value of 10“®. 


iNTRODTTOTIOlSr 

The three fundamental reactions controlling the velocity of a chain polymerization 
process are: (a) initiation reaction, (6) growth reaction, and (c) termination of 
growth. Hitherto the kinetic analysis has made use of two measurable quantities, 
namely, (i) the overall rate of reaction, and (ii) the rate of initiation. The ratio of 
(i) to (ii) will give the kinetic chain length which is, in some cases, equal to the degree 
of polymerization of the polymer. There are, however, three unknowns involved in 
the analysis. Until a third parameter can be measured, further analysis is impossible, 
and, in particular, the values of the individual growth and termination coefficients 
cannot be obtained. The third parameter which requires to be measured is the 



457 


Velocity coefficients for polymerization processes. I 

stationary concentration of the active polymer. If this problem is soluble it is 
practicable to calculate in moles, litres and seconds the value of the growth coefficient 
and thereby obtain a numerical measure of the tendency of a molecule to polymerize, 
according to a given mechanism. Then it should be possible to see much more clearly 
how chemical constitution affects reactivity. Similar information automatically 
becomes available regarding the termination coefficient, thereby providing further 
data about the ultimate fate of the highly active polymer molecules. Precise 
measurements of the kinetic chain length, i.e. the number of monomeric units 
polymerized for each molecule originally activated, and the degree of polymerization 
will enable transfer reaction to be detected, for in such a case the chain length will be 
greater than the degree of polymerization. 

By standard technique the value of the chain length can easily be varied, and 
therefore it may be possible to find whether the growth and termination coefficients 
change with the size of the molecule. This information makes it possible to compute 
accurately what the distribution of the molecular sizes wUl be in the resultant 
polymer. 

An outline of how this problem can be approached has already been given (Jones & 
Melville 1940 ). The present paper describes an accurate method by which the 
abotre-mentioned problems may be quantitatively solved for the case of the 
polymerization of pure liquid vinyl acetate. 


EXPEBIMBKTAn 

In many polymerization reactions it has been established that the presence of 
oxygen has a considerable effect on the course of the reaction, the effect depending 
to some extent on the length of time in which the monomer is in contact with the 
oxy^n. In view of this observation all the experiments were carried 
out in vacuo and aU the materials were carefully freed from air 
before use. Evacuation of the apparatus was effected by means of 
a mercury diffusion pump backed by a rotary oil pump, with the 
customary ILquid-air trap and drying tube in position. Reservoirs of 
the monomer and various solvents were sealed to the main vacuum 
line, and taps leading to the reaction system were covered by mercury 
cut-offs. In certain cases, silicone tap grease, thickened with talc, 
was used on the taps, otherwise Apiezon L was employed. 

The reaction vessel was made of fused silica and was of the design 
shown in figure 1 . The ends of the vessel were optically flat to enable 
accurate measurement of the quantum input to be made. The vessel ^ ^ 

was calibrated so as to act as a dilatometer. For reactions at high tern- vessel 

peratures and involving solvents sealed tubes were used and were 
sealed off in vacuo. Thermostatic control was achieved by surrounding the reaction 
vessel with a copper bath, fitted with silica windows, through which water from a 
thermostat could be circulated. 



458 G. M. Burnett and H. W. Melville 

The lamp used throughout the experiments was a G.E.C, ‘ Osira’, from which the 
outer glass envelope was removed. The lamp then consisted of a high-pressure 
mercury arc which ran at 90V and l-Samp. Fluctuations in the output were 
avoided by feeding the lamp from a constant-voltage transformer^ and by efficient 
shielding from draughts. The shield was made so as to act as a parabolic reflector 
also, giving, therefore, an almost parallel beam of fight which was focused on the 
reaction vessel by means of a silica lens of short focal length. The polymerization 
reaction depends to a great extent on the wave-length of the fight employed. Thus 
at very short wave-lengths there appears to be very considerable decomposition of 
the monomer which seems to result in the production of either a retarder or inhibitor 
of the polymerization process. To avoid this as far as possible a filter of 50 % acetic 
acid in water was inserted in the path of the fight, thus effectively removing all light 
of low wave-length. 

Monomeric vinyl acetate of commercial grade was obtained from Shawanigan Ltd. 
Hydrolysis of the monomer results in the formation of acetaldehyde which has been 
shown to be an inhibitor of the gas-phase reaction so that its complete removal is 
essential. Purification was carried out by the fractional distillation of the crude 
monomer until the distillate gave only a very faint coloration with Schiff’s reagent 
on standing for 5 min. Only the cut between 72 and 73° C was retained. This was 
redistilled at atmospheric pressure and stored in a tightly stoppered dark bottle. 
A suitable quantity of the purified monomer was transferred to the reservoir on the 
vacuum apparatus and sealed off. The monomer was frozen in liquid air and the 
reservoir evacuated, then isolated from the pumps and the liquid air removed. The 
dissolved air escaped and the process was repeated until no trace of air was noted. 
Finally, the first 5 c.c. or so of the monomer was distilled directly to the liquid-air 
trap further to ensure purity. Small quantities of monomer received from Boake 
Roberts Ltd, and I.C.I. Ltd. were similarly treated and gave the same experimental 
results, thereby establishing the method of purification as being adequate. 



FiGinaE 2, The ultra-violet absorption spectrum of vinyl acetate. 
€ = Molar decadic extinction coefficient. 



459 • 


Velocity coefficients for polymerization processes. I 

Benzoquinone which was used as an inhibitor was purified by steam distillation, 
followed by thorough drying and sublimation. For convenience it was made up 
temporarily in fi:eshly distilled acetone. 

The uUra-violet absorption spectrum of the monomer 

The absorption spectrum was photographed on a Hilger quartz spectrograph 
nniTig %-hexane of spectroscopic purity as the comparison liquid. The solvpnt and 
solution were contained in miorocells, 5 mm. in length. The usual technique was 
followed in mapping the absorption spectrum which is shown in figure 2. It will be 
seen that the value of the extinction coefficient falls fairly rapidly at first and then 
more slowly. This is somewhat the same as in the case of methyl methacrylate 
(Melville 1937) and was then associated with the double bond and the —COO. CH3 
group present in that molecule. Similar results with other molecules which do not 
contain an ester group suggests, however, that the two parts of the curve are the 
characteristic of the molecule rather than a part of it. 

** The bulk photopolymerization of vinyl acetate 

Since the density of the polymer formed is independent of the molecular weight 
of the macromolecule, it is convenient to use the density change as an accurate 
measure of the course of the reaction. This is most readily accomplished by measuring 
the contraction of the monomer as it polsmaerizes, the total volume contraction being 
26-82 % for 100 % polymerization at 25 “ C (Starkweather & Taylor 1930). The 
reaction vessel was of such dimensions that a drop of 0-01 mm. which could be 
detected by the cathetometer used, corresponded to an extent of pol3Tnerization of 
about 0-03 %. Checks of the final amount of polymer were made by the determma- 
tion of either the refractive index of the mixture, using the results of Breitenbach & 
Raff (1939), or by the evaporation of the monomer under high vacuum from the 
mixture of the monomer and polymer, and weighing the polymer remaining. 

The course of the reaction over the initial stages is shown in figure 3 . It will be 
seen from table 1 that the reaction is of zero order with respect to the monomer 
concentration. It was decided that it was not worth carrying the reaction further 
than t.hip stage, since the increasing viscosity of the solution made the meniscus 
shape change considerably, and in the viscous liquid temperature control became 
much more difficult because of the hindrance of the convection currents and the 
poor thermal conductivity of the reaction mixture. Also as the conversion becomes 
greater there is the possibihty of an explosion like polymerization (Schulz 1941), 
and that this does occur is shown in figure 4 . The zero order of the reaction over such 
a large percentage of reaction is, at first sight, surprismg, but the explanation would 
appear to be as follows. It will be shown in a later paper that the rate of reaction is 
proportional to the first power of the monomer concentration (naturally this 
cannot be tested with the pure monomer, since the value of the monomer conoeptra- 
tion is fixed by the density of the liquid). During pol3mierization the monomer 
concentration remains practically unaltered, for the gradual accumulation of 


Vol. 189. A. 


30 



G. M. Burnett and H. W. Melville 



Figure 3 . The direct reaction (initial stages). 


Table 1. The course oe bolymerizatioh 
Intensity of illumination = 5 x hv/sec. Temperature =15*9° C. 


time (min.) 

(T) 

contraction 

% 

conversion 

% {C) 

GIT 

0 

— 

— 

— 

4 

0*070 

0*26 

0*065 

8 

0*142 

0*53 

0*066 

12 

0*218 

0*81 

0*067 

20 

0*360 

1*34 

0*067 

25 

0*440 

1*64 

0*066 

35 

0*617 

2*30 

0*066 

45 

0*805 

3*00 

0*067 

60 

1*080 

4*01 

0*067 



Figure 4. The direct reaction showing the ‘gel effect’. 



Velocity coefficients for ^polymerization processes. I 461 

polymer causes no significant reduction in the vapour pressure of this mixture owing 
to its high molecular weight. Similar results have been obtained, in the catalyzed 
polymerization of methyl methacrylate (Norrish & Brookman 1937). 

If the absorption of the activating radiation is dependent on the concentration 
of monomer, the results shown in table 2 may be readily obtained. This shows 
definitely that it is not possible with certainty to derive information on the nature 
of the termination reaction without investigating the overall kinetics to a much 
greater extent. 

Table 2 


termination 

reaction 

spontaneous 

monomer 

mutual 


lowest order of reaction 

(absorption independent highest order of reaction 
of monomer cone.) (absorption (x,{M)) 

1 2 

0 1 

I 1*5 


The intensity exponent of the polymerization will give a definite clue to the 
nature of the chain-breaking process. For either of the first two cases in table 2 the 
rate of reaction is proportional to the light intensity, while for the third case the 
rate is proportional to the square root of the intensity of absorbed radiation. Hence 
a distinction can be readily drawn. 

The intensity of the light was varied by means of perforated screens whose trans¬ 
mission was measured by means of a photocell. The distance apart of the holes was 
such that no interference could be caused due to distinct beams of light, as has been 
described (Jones & Melville 1940). In each case the rates under the total radiation 
and with the screen in position were measured, and in figure 5 the log relative value 
of the rate is plotted as a function of the log relative value of the intensity. It is easy 
to show that the relationship ought to be a straight line whose slope gives the value 
of the intensity exponent, since 


where R is the rate, h a proportionality constant, /(/) some function of the light 
intensity, and n the intensity exponent. Putting the above expression in logarithmic 


form gives 


log = log ifc -h log/( J), 


so that the log plot must obviously be a linear one as described above. From the 
graph of figure 5 the value of the intensity exponent works out at 0 * 50 . 

Using the kinetic theory of MelViUe & Gee (1944) this implies that the termination 
reaction involves the mutual destruction of two of the growing polymer molecules. 
Assuming that the chain carrier is a free radical (evidence for which will be given in 
a subsequent paper), it is obvious that termination may take place by two distinct 
mechanisms. First, the radicals may simply combine to give one inert or ‘dead^ 
polymer, by the scheme 


—CH2—OHX—+—ZHC^CHg-^ —CH2.CHX.XHC.CH2—, 


30-2 



462 


G. M. Bximett and H. W. Melville 

or, secondly, the reaction may be such that one of the free radicals dehydrogenates 
the other so that both molecules are stabilized, thus 

—CHa. CHX—+—ZHC. CHg-s- —CHa. CS^X + CHX=CH—. 

If, as is most Kkely in this case, the initial step is the formation of a free diradical, 

CHa. CHZ + hv= —CH^. CHX—, 

then it is obvious that the most likely—if not the only—mechanism for mutual 
termination is the second of the above. Unfortunately, the value of the intensity 
exponent does not lead to any conclusive mechanism, since for both the above 
mechanisms the value of the exponent is 0 - 5 . 



Figure 6. The intensity exponent. 

It is interesting that in this case the value is so close to 0 * 5 , since it excludes the 
possibility of any other type of termination mechanism operating simultaneously. 
In the case of the vapour phase polymerization of vinyl acetate the value of the 
intensity exponent is appreciably greater than 0*6, which would suggest that there 
is a considerable amount of termination by some mechanism other than mutual— 
probably by the walls of the vessel. For the catalyzed polymerization of vinyf 
I acetate again it is found that the rate of reaction is accurately proportional to the 
square root of the catalyst concentration, again implying that a mutual termina¬ 
tion mechanism is solely responsible for the breaking of chains (Cuthbertson, 
Gee & Rideal 1937). ^ 

' The quantum yield 

As a quantum counter in these experiments a Baird television photocell, fitted 
with a silica envelope, was used. In the set-up employed the photocurrent flowed 
through a number of resistances in series in order to facilitate the use of the instru¬ 
ment over a wide range of light intensities. The potential difference over a suitable 
resistance was measured on a Marconi type valve voltmeter, capable of reading to 



Velocity coefficients for polymerization processes. I 463 

V 

0* 1 mV. The high-tension supply for the photocell was an ordinary 120 V dry battery, 
the voltage of which was frequently checked in order to ensure constant running 
conditions for the photocell. All the comparative measurements were made on the 
linear portion of the characteristic cicrve for the photocell. 

The photocell was calibrated for various lines ill the mercury spectrum against 
uranyl oxalate, using the following technique. The cell was mounted in a brass tube 
which was fitted with an aperture about 7-5 mm. in diameter. Light from a high- 
pressure arc was focused on the aperture by means of a pair of silica lenses in such a 
way that the diameter of the beam was just equal to that of the opening, and the 
image fell centrally on the sensitive plate of the photocell. Using a suitable filter, 
a given line or narrow band in the ultra-violet region could be isolated. After a 
reading had been obtained on the voltmeter, the photocell was removed and replaced 
by a shallow dish containing a known amount of uranyl oxalate solution. After a 
definite period of illumination—usually about 6 hr.—^the oxalate was titrated 
against a standard solution of potassium permanganate at 80 ° C. Comparison with 
a blank titration made the computation of the light intensity possible with reference 
to the quantum yields of the oxalate reaction. It was found that the sensitivity of 
the cell increased rapidly at first and then less rapidly and finally reached a maximum 
in the visible blue region. 

Losses of light in these experiments could result from {a) reflexion at the various 
interfaces and (6) absorption by the water of the thermostat bath. Losses by 
reflexion were found to be negligible by comparison of the Hght transmitted through 
the empty reaction vessel and with the vessel filled with 7i-hexane of spectroscopic 
- purity. This is not altogether surprising, since the refractive indices of the liquid 
and sihca are not very much different. Losses due to absorption by the water 
amounted to some 7 % of the total radiation, by a similar comparative measurement. 

A silica lens was used to bring the light into a slightly converging beam of approxi¬ 
mately the same diameter as the reaction vessel. The final image again feU centrally 
on the sensitive plate of the photocell. The light beam required to be slightly con¬ 
verging in order to fulfil the final condition. AU light which did not pass through the 
reaction system was prevented from reaching the photocell by means of a light metal 
shield, with a sleeve, which fitted round the end of the reaction vessel. The intensity 
of the ultra-violet light passing through the ceU was measured both with the 
cell empty and filled with monomer, from which the intensity of the absorbed 
radiation could be found. Frequent checks were made in order to ensure that 
the running of the lamp was perfectly steady and the light absorbed was always 
constant. 

The quantity of polymer produced was determined by measurement of the 
refractive index of the mixture on a Pulfrich refractometer, using the sodium D 
line as a source of illumination. Table S gives the results obtained from these 
experiments. 

The figure in the bracket in column 3 of the table was obtained by evaporating off 
the monomer under high vacuum and weighing the residual polymer. It wiU be noted 



464 G. M. Burnett and H. W. Melville 

from, this table that the values of the kinetic chain length, Vf^, are in the ratio of 
6 - 6 : 1 , while the ratio of the square roots of the corresponding intensities is 7 : 1 , 
which is in good agreement with the theory. 

Table 3. The quantum yield 
Temperature = 15-9° C. A.= 2800 to 3200A. 


intensity of 
illumination 

time of 
illumination 

wt. of polymer 


(Aj'/sec.) 

(sec.) 

(g-) 


9-83 X 10^3 

24,800 

0*456 

1500 


39,600 

0*714 

1460 


11,500 

0*227 (0*222) 

1440 (1460) 

4*90xl0is 

3,600 

0*476 

220 


1,500 

0*204 

228 


3,600 

0*481 

222 (223) 


Assuming that no transfer takes place in this reaction, it should be possible to 
obtain a .value for the primary quantum efficiency from a determination of the 
number average molecular weight of the ffiaal product. This is, however, complicated 
by the fact that the nature of the termination reaction is not known with certainty. 
If the breaking of the chains should result in the formation of two molecules of 
‘dead’ polymer, then this method wUl give the true value of the primary quantum 
efficiency. If, on the other hand, the termination reaction should bring about the 
formation of only one inactive molecule due to the combination of two of the 
growing chains, then the molecular weight would obviously be, on the average, 
twice the kinetic molecular weight, so that the value of the quantum efficiency would 
be half of the true value. Osmometrio measurements on the samples obtained from 
the first three experiments are quoted in table 3 . The osmometer used was of the type 
suggested by Fuoss & Mead {1943) and had a vertical capillary. The membrane used 
was bacterial cellulose, details of whose production, treatment and use have already 
been given (Cruickshank, Masson, Melville & Menzies 1946). The results obtained by 
this method give a molecular weight of 120,000 ± 4000 or a chain length of 1400 + 50 , 
which is in comparatively good agreement with the kinetic chain length and suggests 
that the primary quantum efficiency is probably unity. This means that the number 
of chains started per second is equal to the quantum input per second. 

In order to verify this conclusion as completely as possible it is necessary to devise 
a means of ensuring that the number of chains started can be counted. By the choice 
of a sufficiently strong inhibitor this becomes possible. It has long been known that 
p-benzoquinone acts as an inhibitor towards the thermal polymerization of styrene 
(Foord 1946). Since it is highly probable that the chain carrier in this case and also 
in the photopolymerization of vinyl acetate are of the same type, p-benzoquinone 
was tried as an inhibitor of this reaction. The ultra-violet absorption spectrum of 
the quinone has a mininunn in the region mder discussion, so that there can be no 
internal filter effect (Marchlewski & Moroz 1924). The following general observations 



Velocity coefficients for polymerization processes, I 465 

were made on the behaviour of the quinone in this reaction. Inhibition is complete 
over a considerable period whose duration depends on the concentration of the 
quinone initially present, and during this period the colour of the solution fades and 
finally disappears. On the disappearance of the colour the rate of polymerization is 
the same as that which is found without the inhibitor. This means that it is impossible 
for the inhibition to result in the formation of a compound which may act as an 
inhibitor or even retarder of the polymerization reaction. Table 4 gives the results 
for the experiments, carried out using a light intensity of 5-0 x 10^® Av/sec. at 25° C. 

Table 4. IimiBmoisr by QurisroisrE 


Weight of monomer = ll'lS g. 


wt. of quinone 

duration of 
inhibition 


(mg.) {W) 

(min.) (T) 

TIW 

2-4 

48 

20-0 

5-0 

103 . 

20*6 

7-5 

148 

19-9 


In figure 6 is plotted the course of a number of reactions which shows clearly the 
effect of the addition of the quinone. The curves obtained by Foord for the thermal, 
polymerization of styrene follow a somewhat similar course for benzoquinone 
inhibition. Foord foimd that other quinones were not so effective, but, unfortunately, 
this cannot be tested here because of the absorption spectra which contain maxima 
in the region of 3000A in the case of the other members of the quinone fainily. 



Figtoe 6 . Inhibition by ^-benzoquinone. 


Experiments of the same type were carried out under the conditions employed for 
the quantum yield experiments. The same proportionality was observed, and the 
results of the experiments are shown in table 5. 



466 


G- M. Burnett and H. W. Melville 


In the calculation of the primary quantum efficiency in this table it has been 
tacitly assumed that only one molecule of the quinone is required for the breaking of 
the chain. The stoichiometric analysis of the reaction by Goldfinger et aL (1943) tends 
to justify this assumption. Also it is evident from the shape of the curves in figure 6 
that the initial growth before stopping can be beyond probably a dimer, but even 
if a dimer is formed this has no effect on the calculations given above. Chemically 
the reaction stiU requires further investigation, although some points may be made 
herej n ord^ ^^ ^clarif y the situation. 

Table 5. Pbimaby quantum eeeicienoy from quinone inhibition 


wt. of quinone 

duration of 
inhibition 

primary- 

(mg.) 

(min.) 

efficiency 

. • 0-75 

670 

1-00 

0-13 

112 

0*98 

0-29 

250 

0-98 


The mode of action of quinone appears to be twofold in the inhibition of poly¬ 
merization reactions. If the reaction is initiated thermally or photochemically then 
the results which have been given here are applicable. If, however, initiation is by 
a catalyst, such as a peroxide, the final state is reached only slowly. There is initially 
a period of complete inhibition followed by polymerization, and the rate of poly¬ 
merization increases slowly towards a maximum. We suggest that the difference is 
due to a difference in the nature of the chain carrier. In the first case it is almost 
certainly a diradical and in the second a monoradical. The dimeric diradical could 
undergo a modified Diels-Alder reaction with the quinone, thus; 


0 



Now it is known that the addition of a further such diradical is unlikely, so that 
the fact that inhibition or retardation is not caused by the product is accounted 
for. In the case of styrene some progress was made along these lines, and a 
compopind of molecular weight 310 and constitution C22Hig02 was isolated. This 
conforms to the above if X = CqK^. It is interesting to note that in the case of 
the catalyzed reaction considerable quantities of hydroquinbne were isolated. 
This is an efficient retarder of the photopolymerization of vinyl acetate (Vernon & 
Taylor 1931), so that further support is given to the ideas which have just been 
put forward. 



Velocity coefficients for polymerization processes. I 


467 


The normal reaction between a free monoradical and a quinone molecule will 
account for the formation of hydroquinone. This reaction may be shown schemati¬ 
cally as 



R 


but this appears to have some weakness in that a^ substituted quinone is formed 
which has probably a high colour—^which is not supported experimentally—and 
more than one quinone molecule is used to stop the chain. 


The energy of activation 

In gas-phase polymerization processes the measured energy of activation always 
appears to be negative, which is not in keeping with the general kinetic conceptions. 
This may be due to the fact that in such reactions the formation of the polymer 
implies the formation of a new phase upon which adsorption of the monomer may 
take place. As the heat of adsorption is, in most cases, unknown, no correction can 
be applied. In a system which remains homogeneous during the course of the re¬ 
action, it would be expected that normal values of the energy of activation would be 
obtained. 

In this case the rate of the reaction was measured in sealed tubes at temperatures 
from 18 to 45 ° C. Corrections were applied to allow for the density changes at the 
different temperatures although these were not very great. The reaction velocity 
was found to increase as the temperature increased, thus giving rise to a positive 
value for the eneigy of activation. The usual plot of log rate against the reciprocal of 
the absolute temperature gives a value of 4-40 kcal./g.mol. for the energy of activation. 
The results are set out in detail in table 6. 


Table 6. The bneegy of activation 


temperature 

318 

310 

302 

295 

288 


relative rate 

2-02 

1-71 

1-42 

1*19 

1-00 


log 

(relative rate) 

0-305 

0 - 232 ^ 

0-154^ 

0-078^ 

0-000 


E 

(kcal./g.mol.) 

4-21 . 
4-48 
4-40 
4-48 



468 


G. M. Burnett and H. W. Melville 


The value for the energy of activation agrees with that obtained by Vernon & 
Taylor (1931), who found a value of approximately 4 kcal./g.nioL The positive 
value of the activation energy appears to support the view that for gas-phase re¬ 
actions a correction due to the heat of adsorption ought to be made. This had been 
done in the case of the polymerization of chloroprene (Bolland & Melville 1938) 
when the corrected value is of the same order as that given here. It will be shown in 
a later section that this energy of activation is wholly due to the propagation step 
of the reaction, and since the primary quantum efficiency is unity, i.e. as high as 
possible, it is probable that the rate of initiation is temperature independent, so that 
the energy of activation of the termination process is zero. 

Transfer reactions 

Although it has been stated in several papers that transfer is liable to occur in 
bulk polymerization, there is little in the way of concrete evidence to support the 
view. Although it has been shovn that there is no detectable transfer in this reaction 
it might be of interest to work out the formal kinetics of this type of reaction on the 
lines adopted by Melville & Gee (1944). 

The reaction steps may be formulated as 


M+hv initiation 


Pi+jf=P2 ] 

K 

p,+if=p„J 

Kn 

i^+Jf = lf„+Pi . transfer 

hn. 

Pn+Pm Mm +termination 



where the symbols have their usual significance. Since the reaction ceases when the 
light is removed, it follows that the stationary state method of analysis can be used. 
Therefore 

® =/(/)+= 0, 

or, generally, 

^ = haK-^){M)-lMW-hn^Pn){M^^ = 0 . 

By adding these equations to infinity then 

m={sk^iPs)Y. 

The rate of reaction ■will be given by 




469 


Velocity coefficients for polymerization processes, I 
Now, assmning, after the manner of Melville & Gee, that 

where d and /4 are both independent of the molecular size, then 

If, as is general, the chains are very long, then the first term may be omitted and 
substitution of the value of found above will lead to 

This means that the reaction will still be proportional to the square root of the light 
intensity providing thaft the mutual termination mechanism is still operative for the 
final breaking of the chains. 

DETERMINATIOlSr OE THE LIFE TIME OF THE ACTIVE POLYMER 

As was pointed out earlier in this paper, the overall rate for the bulk photo¬ 
polymerization of vinyl acetate is proportional to the square root of the intensity 
of the absorbed radiation. This feature of the reaction makes it possible to obtain 
experimentally the value of the life time of the growing chain. The fact that the 
reaction velocity is constant during the initial stages of the reaction makes this a 
comparatively straightforward matter. The use of intermittent illumination to 
achieve the measurement of the life time of the active particle was suggested by 
Chapman, Briers & Walters (1926), but his mathematical analysis made it necessary 
to rely on trial and error methods for the ultimate determination. The mathe¬ 
matical treatment was considerably simplified by Melville (1937) in an attempt to 
apply the method to the vapour-phase polymerization of methyl methacrylate. The 
choice of this monomer ma*de it impossible, on account of the complexity of the 
reaction, to obtain much more than a very rough idea of the life time. 

The theory which will be elaborated here is a simplification of a somewhat more 
complicated theory due to Dickinson (1941). The theory will be applicable to all 
chain reactions in which the rate obeys the square-root law which has been men¬ 
tioned. In the case of polymers there is the added difficulty that there is the possibility 
that the values of the kinetic coefficients wiU depend on the molecular size of the 
polymer molecules involved, so that in the simplified theory only an average value 
for these quantities can be obtained. It is, however, easily practicable to change 
the molecular size of the particles at will by varying the intensity of the light, since 
the molecular weight of the product is inversely proportional to the square root of 
the light intensity. By this method it is obviously possible to investigate any change 
in the values of the kinetic coefficients with molecular size. By measuring the fife 
time of the active particle at different temperatures it is also possible to separate 



470 


G. M. Burnett and H. W. Melville 


the energies of activation for the various stages of the reaction, and this will even¬ 
tually lead to the complete fundamental analysis of the reaction. 

One of the main difficTjlties with this reaction is that the chain length caimot be 
reduced much below 150 . With this length of chain it is highly probable that the 
kinetic coefficients are relatively insensitive to further changes in molecular size. 
At much lower chain lengths, say of the order of 10, this is unlikely to be true, and an 
investigation along those lines might be profitable. 


ThBOEY or INTEEMITTENT ILLUMIITATION' 

In general, it can be stated that if the rate, S, of a photochemical reaction is 
related to the intensity of absorbed radiation by 

where «. < 1, then for intermittent illummation in which the periods of light and dark 
are very short, i.e. very much less than the life time of the active species, and 
are equal, the situation is the same as that for using half the intensity, so that 

For long periods of light and dark the system is virtually illuminated for only half 
the time, so that 

Tt 

Hence follows the ratio = 2^“", 

-^long 

and if » = I, then -RshortZ-Kiong = 

In the polymerization reaption it will be assumed that steps are represented by 
the following sequence of reactions: 

initiation /(/) 
propagation 
termination A, 

where Mis a. monomer molecule, P, an active polymer molecule of r units, and 
the corresponding deactivated polymer. When a steady state has been reached it 
is obviously true that the stationary state equation will be 

^=/(7)-fc,(P)| = 0, (1) 

where (P)g is the steady concentration of all the active particles under a steady 
illu min ation of /(/) and ki is the termination velocity coefficient. Hence, from 
equation (1), 




(2) 



471 


Velocity coefficients for polymerization processes, I 

For intermittent illumination the periods of light and dark have to be considered 
separately. During the light period the variation in (P) will be governed by the 
equation ,, pv 

( 3 ) 

and during the dark period by 

^ = -klPf. ( 4 ) 


Integration of equation ( 3 ) leads to the instantaneous value of the concentration of 
all the active particles at any time during the light flash, i.e. 

which, by substitution of the expression given in equation (2), leads to the form 


Integration of equation ( 4 ) leads to the value of (P) during the dark period. This will 
be ^ 

— + ( 7 ) 


where (P)g is the value of (P) under steady illumination, and c^, Cg and Cg are constants 
of integration. 

The average concentration of the active particles during the light period will be 
given by the expression 

(n=7f {P)^ 

^ J over light 


= (* *tanh {(P)j {kit+ Cg)} dt 

^ Jr, 

_ f 1—tanll^ (P)g (kiT^+ Cg) ] * 

kft ll-tanh*(P)j(i<Ji+C2)| 

2v \i-{(py(pur 


( 8 ) 


In the same way the average value of (P) during the dark period is given by 


{Ph = 


2 rk,t 


In 


UP)J ’ 


(9) 


where (P)i and (P)^ denote the values of (P) at the beginning and end of the dark 
period, t is the duration of the light period and r is the ratio of the dark to the light 
period. 

The average value of (P), (P)^ over all will be defined by 


(r+l)(P)o = (P)j+r(Py 


(10) 



472 G. M. Bumett and H. W. Melville 


Substituting equations (8) and ( 9 ) in equation (10) leads to the expression 

From equation (6) is seen the relationship that 

tanh-i^^-tanh-i|^“ = ktiP\ = = m, 


and from equation ( 7 ) 


(P), 


Equation (12) may be rewritten in logarithmic form to give 

"Jm I + . hj l-«PWW l 

Substituting this in equation (11) leads to 

Substitutiug further the value of {P)^ given by equation ( 13 ), then 

Since from equation (12) m = equation ( 15 ) will reduce to 

" ^+m^[^+l+(Py(P)J- 


(11) 

( 12 ) 

( 13 ) 


( 14 ) 


( 15 ) 


HP)JiP) 

For the steady state the life time, t, of the active particle will be defined by 

number of P per unit volume 


( 16 ) 


T = 


number of P disappearing per unit volume 

= {P)MP)i= mp)s, 


which, from the definition of m, gives 

T = tjm. 

Hence m is the number of times that the duration of the light flash is greater than 
the life time of the active polymer. 

Figure 7 shows the value of (P)/(P)^ plotted as a function of m, calculated from 
equations (6) and ( 7 ), assuming that the constants of integration can be neglected. 
It will be seen that for slow rotation of a sector, i.e. when the periods of light and dark 
are long compared with the life time of the active polymer the value of {P)l{P)s 
rises to almost unity during the light period and falls to almost zero during the 
subsequent dark period. During the succeeding cycles this behaviour is simply 
repeated. When the periods of light and dark are short in comparison with the life 



473 


^ Velocity coefficients for polymerization processes, I 

time, i.e. when a fast-rotating sector is used, the value of (P)/(P)s rises-rapidly 
during the light period and falls less rapidly during the dark period until a state is 
reached in which the rise during the light period is offset by the fall during the dark 
period. This results in the rapid establishment of a steady state, which, in figure 7, 
is when the value of {P)l{P)s is 0-5, since the ratio of light to dark periods is 1:3 in 
this case. 



Figube 7 . Value of (P)l{P)g as a function of time. Curve I, high-sector speed. 

Curve II, low-sector speed. 

Figure 8 shows the values of (P)i/{P)s, {P)J{P)s and (P)o/(-P)s again as a function 
of m for the ratio of dark to Kght periods equal to 3. The important part of this 
figure is the first curve, since it is from this and the others corresponding to it for 
different values of r that the curves shown in figiire 9 are derived. These curves show 
the variation in the overall concentration of the active species with sector speed for 
various values of r. For convenience the ordinates have been multipKed by (r-h 1)^ 
so that the curves are all as3nQiptotic to unity. It will be seen that the higher the value 
of r the more rapid is the descent of the curve, but this advantage is offset by the 
fact that the overall rate of the reaction is considerably reduced. Indeed, in many 
cases there is little advantage in using a high ratio of light to dark, and equal periods 
of light and dark will suffice since the overall change in velocity is of the order of 
30 % for the transition from very fast to very slow sector speeds. 

As was stated earlier this idea is based on a somewhat similar analysis due to 
Dickinson (1941) and possesses several advantages over th§ original theory by 
Chapman et aL (1926). As wiQ be seen from the curves in figure 9, for r = 1, i.e. for 
equal periods of light and dark, the value of (P)o/{P)5 begins to fall off when m = 1, 
i.e. when the duration of the light flash is equal to the Hfe time of the active particle 
under steady illumination. Now the rate of the reaction will be proportional to the 
concentration of the active'particles, so that the relative values of the rat^ under 
steady and intermittent fllumination will follow the same course as the curves for 



474 Gr« M. Burnett and H. W. Melville 

the relative concentrations under the different conditions. This means that the 
value of the rate will also begin to fall when the value of m is unity, so that the value 
of the life time of the active species can be stated definitely without further calcula¬ 
tion. It should be noted that the value of the life time obtained is under steady 



Figtoe 8 . Values of (F)i/{P)8, (P)J(P)s> {P)J(P )8 functions of m. 



Figube 9. Theoretical curves of reaction velocity as functions of m, for various values of r. 
Curve I, r = 1 ; curve 11, r = 3; curve III, 8; curve IV, r = 16; curve V, r = 60. 


illumination. Alternatively, the rate of reaction may be plotted against log(/(/)^^), 
and on the same scale the appropriate curve from figure 9. The separation of these 
two curves will automatically give the value of log so that the velocity coeffi¬ 
cient for the termination reaction may be determined directly. 



475 


Velocity coefficients for polymerization processes. I 

Another method of finding the value of r eliminates the necessity of drawing the 
full curves with any degree of accuracy. It is merely necessary to obtain the rate of 
reaction with the sector rotating at a very high speed and then at some intermediate 
speed for which the duration of the light flash is known. The ratio of the rates for 
these two speeds is calculated, and the corresponding value of m found from the 
theoretical curve which is applicable. Hence the value of r may be found since m 
is, by definition, the number of times that the duration of the light flash is greater 
than the life time. Such a simplification was not used here, since it was of greater 
interest to see how the theory corresponded to practice, but the method was used 
later in connexion with the value of the kinetic coefficients in precipitating media, 
details of which will be published in a later paper. The term f[I) which has been 
used throughout will, in effect, be the absolute rate of initiation of the chains and, 
in this case, fortunately, it is possible to deduce this from the benzoquinone inhibition 
which was described earlier. This naturally limits the applicability of the method 
to systems in which the evaluation of the primary quantum efficiency can be 
determined. 


Experimental technique 

The simplest method of obtaining intermittent illumination is by the use of 
disks, cut with sectors, which are rotated at a constant speed in the path of the 
light. This practice has been followed here. Th^ disks were made from galvanized 
iron sheet, painted black to avoid reflexion effects, and having a diameter of about 
14 in. The sectors cut in the disks had a depth of about 2 in. and an opening of either 
\7T or The former was used when long periods of light and dark were required 
and the latter for short periods of illumination—down to about 7 msec. This was 
found to be the most satisfactory method, since it was not found to be possible to 
control the speed of the motor at excessively low speeds. The sector opening has tp 
be greater than the area of the reaction vessel in order to avoid merely scanning the 
system. 

The sector disk was screwed to a copper bush whose spindle was insulated and 
connected to the driving axle of a 35 W universal electric motor. The outside of the 
spindle of the bush was fitted with two pieces of inlaid insulation. Brushes, made 
from phosphor bronze strip, were constructed to run on the spindle; the system was 
connected through a 12 V supply to a Veeder Root magnetic coxmter having a 
resolution greater than SOO/miu- The time required for a given number of revolutions 
was measured on a stop-watch reading to OT sec., and from this the duration of the 
light flash could be calculated. The speed of the motor could be varied by means of 
a ‘Variac’ autotransformer whose output varied from 0 to 250 V. Checks on the 
speed of the motor were frequently made in order to ensure constant ru nn ing 
conditions. 

In carrying out a run the following procedure was adopted. Since, as has already 
been pointed out, the rate of reaction is constant during the initial stages, it is 
possible to carry out a complete determination of the life time in one run. The sector 

ai 


Vol. i8g. A. 



476 


G. M. Bumett and H. W. Melville 


was allowed to run at a constant speed for 10 to 16 min., and the drop in the meniscus 
level measured by means of a cathetometer. The speed of the motor was altered and 
the rate of reaction again determined. This made it possible to cover a considerable 
number of sector speeds in one run, but obviously the technique can be used only if 
the reaction is a linear one. 

These experiments were carried out at temperatures of 16'9 and 31 ' 4 ° 0 to find 
the values of the energies of activation for the termination and propagation steps 
of the reaction, since it is assumed that the rate of initiation is independent of 
temperature. To check the possible effect of molecular size the intensity was reduced 
by the ins ertion of a suitable screen, to 0-25 of its original value, which has the effect 
of doubling the chain length. 

Results 

The main results of the experiments are set out in table 7 . 

Table 7. Rate oe ebaction oompaebd with duration ob flash 
at 15-9° C ‘ at 31-4“ C 


time of flash 

% rate for high 

time of flash 

% rate for high 

(msec.) 

sector speeds 

(msec.) 

sector speeds 

10 

100 

10 

100 

16*5 

100 

15 

100 

20 

100 

20 

100 

22 

100 

35 

95-5 

35 

96-4 

90 

88-5 

45 

80-6 

250 

79-0 

180 

76-7 

350 

74-2 

240 

73-6 




The values are plotted as a function of the time of flash in figure 10. From this 
curve compared with the theoretical curve and using the method of interpretation 
suggested, the life time of the active particle is 2-26 x 10“® sec. at 16 - 9 ® C. 



FiGtTBB 10. Bate of reeictioii as a function, of duration of flash. 

As was explained earlier, the equations used in the calculation of the individual 
constants have to be simplified in comparison with those used by Melville & Gee. 
It is assumed that for the propagation step the actual value of the velocity coefficient 



477 


Velocity coefficients for polymerization processes. I 


can be written as but there is no assumption that this coefficient is, indeed, 
independent of molecular size. Similarly, the velocity coefficient for termination is 
written as k^. Thus it will be seen that the results quoted here will be average values 
for some particular average degree of polymerization. Making these assumptions the 
steady state equation will be 

=/(/)-&,(P)^ = 0 . 

where (P) denotes the concentration of ah the active particles. The overall rate of 
reaction will be given by the expression 




(17) 


Since/(/) is the intensity of the light usefully absorbed it is, in effect, the rate of 
starting of the chains. It was found that the length of the inhibition period for the 
addition of 0*0024 g. of benzoquiuone was 48 min. Assuming, as previously, that one 
molecule of the quinone is effective in stopping a chain then the useful light input 

^ /(/) = 5*0 X lO’^® Aj//sec. 


The final results are required to be in litre, g.mol. and second units, so that this has 
to be converted to units corresponding to those. The measured capacity of the 
reaction vessel was 13*0 o.e., and we consider that we are justified in saying that 
this is the volume of the absorbing medium. Hence 

/(J) = 6*35 X 10“’einsteinl.“^sec."^. 

The overaE rate of polymerization under this steady iUummation was found by 
direct measurement on the dEatometer and gives that 


d{M) 

dt 


1*1 X 10~*l.g.moL“^sec.“\ 


taking the density of vinyl acetate as 0 * 86 . 

It is possible to show that under conditiorys of steady Elumination, the foUowing 
equaiion is true provided that r is smaE (Jones & MelviEe 1940 ): 


d{P) (P) 

dt T 


(18) 


In this case, the value of d{P)jdt is given, which is the rate of starting chains, and, 
the Efe time of the active species, so that it is possible to calculate the concentration 
of the active particles. This gives 


(P) = 1*42 X 10“®g.mol./l. 

In order to calculate the velocity coefidcient for the growth process equation (17) 
can be rewritten in the form 




-d{M)fdt 
iP)(M) ' 


Substituting the relevant values in this, then 


kp^ 7*7 X 102Lg.mol.^^s6c.“^. 


31-2 



478 G. M. Bumett and H. W. Melville 

Also from equation (18) it is seen that 

kUMffjl) 

~ {-d{M)ldtY’ 

which, on substitution of the appropriate figures, leads to 

= 3'1 X 10 ®Lg.mol.“^sec.“^. 

These experiments were repeated at 31-4° 0 when it was found that the value of 
the life of the active polymer had not changed appreciably—the new value of the 
life time was 2-30 x 10 ~®sec. It was considered that this was within the limits of 
experimental error to be expected from this type of reaction, so it can be said that 
the life time does not change. It is obvious from the analysis that has preceded 
this that if the life time does not change then the value of the velocity coefficient for 
the termination process also has not changed. This means that the energy of activa¬ 
tion for the termination process must be zero, and, therefore, the overall energy of 
activation which has already been given must be the energy of activation of the 
propagation step, since it is justifiable to assume the initiation rate to be temperature 
independent. Since must be defined by 

(19) 

which gives a value of l- 66 x 10 ®l.g.mol.“’-sec.~’- for Ap. 

Assuming, as is most likely, that the growth reaction is as a result of a bimoleoular 
collision between the growing polymer and the monomer, the number of such 
collisions taking place in a normal liquid per sec. is 10 ^^ (Moelwyn-Hughes 1940 ), so 
that the steric factor will be of the order of 10 ”®. The value of lO^’- for the number of 
collisions per sec. is taken, hx this case, since it is believed that the number of 
collisions between the monomeric molecules and the active end of the chain will be 
of the same order as that between molecules in a pure liquid. In effect, the motion 
of the macromolecule as a whole is not considered but only the active part of it, 
whose motion will not be seriously restricted. In the same way it is obvious that the 
A factor for the termination reaction will be of the order of 10® so that the Steric 
factor will be around 10 ”®. > 

From the constants which have been givm it is possible to calculate the value of 
the kinetic chain length. This is done most simply from the relationship 

-d(Jf)^ 

m ’ 

where Vj^ is the kinetic chain length. The relevant figures give the value of the chain 
length as 174. In order to check this conclusively osmometric measurements were 
made on the polymer in order to obtain the number average molecular weight. This 
was found to correspond to a chain length of 176. 

A similar series of experiments was carried out with a light intensity of 
1-6 X lO”*^ einstein l.'^sec.""^. In this case the chain length was increased, but the 



Velocity coefficients for polymerization processes. I 479 


life time did not increase in quite the same ratio. Table 8 gives the results of both 
series for comparative purposes. 


f{I) (einstein 1.”^ sec.-^) 
r (sec.) 

— d(M)ldt (g.mol. 1."’^ sec."*^) 
kj, (l.inol.“i sec.-"i) 
kf {l.mol.-i seo.-i) 

hIK 

n 

steric factor (propagation) 
steric factor (termination) 


1*60 X 10 -’ 
5-10 xlO-® 
6*51 X 10-5 
6-7 X 102 
2*5 X 102 
3*8 X 10 « 
339 
352 

10-5 10-5 

10-2 10-2 


Table 8. The kestetic cobeeicibnts at 15-9° C 

6-35 X 10 -'^ 

2-26 X 10-2 
MO X 10-2 
7-7 X 102 

3 - 1 X 10 » 

4 - 0 xl 0 » 

174 
176 



log m 

Figijbe 11. Experimental and theoretical curves. Curve I, theoretical. 

Curve II, experimental. 

There appears to be some reason to expect a slight variation in the values of the 
kinetic coefficients, as the degree of polymerization varies. This is compatible with 
the idea that the macromolecules are extended in solution. If the original molecular 
model of a spherical molecule with an active patch had been correct, it is easy to 
see that the order of the change would have been considerably greater. Thus an 
extended molecule is seen whose active end is subjected to only slightly restricted 
motion. 

Figure 11 shows a normalized experimental curve (curve II) with the th^retical 
curve (curve I). It will be seen that except in the lower regions there is very good 



480 G. M. Burnett and H. W. Melville 

agreement. It may be that the discrepancy is due to the fact that the theory of 
polymer reactions has had to be very much simplified in order to allow of the formu¬ 
lation of the theory of intermittent illumination. In the lower region, i.e, with 
slower sector speeds, if the degree of polymerization increased it is possible that there 
is a drop in the value of the termination coefficient, this would account for the 
discrepancy. 

The most interesting of the results appears to be the comparatively high value of 
the steric factor for termination as compared with that for the propagation step. 
Evans { 1946 ) has attempted to explain this on the basis of the transition state 
theory. For this he assumes that the monomeric unit in the free state is capable of 
oscillating more freely than one incorporated in the polymer molecule. From this 
premise he has been able to correlate these results with the theory and derive results 
which are of a reasonable order of magnitude considering the nature of the reaction. 

The authors have to thank Mr C. R. Masson, B.Sc., for the osmometric results 
quoted, and one of us (G.M.B.) is indebted to Distillers’ Co. Ltd. for financial 
assistance (1946-6). 


ReI'ERBNCBS 

BoUaad, J. L. & Melville, H. W. 1938 Proc. Bubber Tech, Conf, Preprint no. 90. 
Breit^baoh, J. W. & Raff, R;. 1939 Ber. dtsch. chem, Oes. 69, 1107. . 

Chapman, B. L*., Briers, F. & Walters, E. 1926 J, Chem, Soc, p. 662. 

Gniickshank, J., Masson, C. R., Melville, H. W. <& Menzies R. F. 1946 Nature, 167, 74. 
Cuthbertson, A. C., Gee, G. & Rideal, F. K. 1937 Nature, 140, 889. 

Dickinson, R. G. 1941 Fhotochefrdstry of gases, p. 200. Leighton and Noyes (Reinhold). 
Evans, M. G. 1946 (In the Pres®.) 

Foord, S. G. 1940 J, Chem. Soc, p. 48. 

Fuoss, R. M. & Mead, E. J. 1943 J, Phys, Chem, 47, 59. 

Goldfinger, G., Skeist, I. & Mark, H. 1943 J, Phys, Chem, 4!7, 578, 

Jones, T. T. & Melville, H. W, 1940 Proc, Boy, Soc, A, 175, 392. 

Marehlewski, L. & Moroz, A. 1924 Bull, Soc, chim, Paris, 35, 473. 

Melville, H. W. 1^37 Proc, Boy, Soc, A, 163, 611. 

Melville, H. W. & G^, G. 1944 Trans, Faraday Soc, 40, 240. 

MoelTvyn-Hughes, E. A. 1940 Physiccd chemistry, p. 72. Camb. Univ. Press. 

Norrish, R. G. W. & Brookman, E. F. 1937 Proc, Boy, Soc, A, 163, 205. 

Schulz, G. V. 1941 JZ, Elehtro<^em, 47, 266. 

Starkweather, H. W. & Taylor, G. B. 1930 J, Amer, Chem, Soc, 52, 4708. 

Vernon, A. A. & Taylor, H. S. 1931 -J. Amer, Chem, Soc, 53, 2627. 



Determination of the velocity coefficients for 
polymerization processes 

II. The photosensitized polymerization of vinyl acetate 

by benzoyl peroxide 

By G. M. Btjenbtt and H. W. Melvudb, F.R.S. 

Chemistry Department, University of Aberdeen 

{Received 7 November 1946 —Read 6 February 1947) 

The photopol37merizatioii of vinyl acetate has been found to be very much accelerated by the 
presence of small quantities of benzoyl peroxide. This is considered to be the result of a 
photochemical decomposition of the peroxide, to give free radicals such as phenyl and 
benzoate. The dependence of the rate of polsrmerization on the peroxide concentration 
suggests that there is a considerable inhibition of the reaction due to the presence of the 
peroxide. The value of the intensity exponent approaches unity as the concentration of the 
peroxide is increased, and the formal analysis of the reaction suggests that the inhibition is 
due to the peroxide molecules themselves. The analysis of the induction period and the dark 
period shows that these are identical and therefore real, so that it can be concluded that they 
depend essentially on the nature of the initiation step. The value of the initiation velocity 
coefficient is calculated, and the energy of activation for this step is foimd to be 8*4 kcal./g.mol. 


Introdvction 

The results of the direct photopolymerization of vinyl acetate suggest, although no 
concrete proof is available, that the chain carrier is a free radical. By adding benzoyl 
peroxide to the monomer and illuminating at temperatures at which the thermal 
catalyzed reaption is negligible, it was found that a photosensitized reaction took 
place. If a kinetic similarity between a radical reaction and the photo-excited 
reaction can be established, then the proof of the radical nature of the chain carrier 
in the latter case is without doubt. It is evident, on account of this sensitized 
reaction, that such a method is not straightforward. The thermal breakdown of 
benzoyl peroxide gives rise to free radicals probably by the scheme (Hey & Waters 

1935) 

O^HjCO. O. O. OC. CgHs ^ CeHs— -h CgHg. COO— -i- COj. 

One or other of these free radicals is then capable of initiating a polymerization 
chain. If a similarity in kinetics can be established between the thermal catalyzed 
and the sensitized reaction, it is evident that the latter is also a free radical reaction. 
Establishment of a like similarity between the direct photochemical and the sensi¬ 
tized reaction will again show an identity of mechanism for these proc^ses. It is 
not possible to make such a comparison directly, since the catalyzed reaction has 
a pronounced induction period. 

The only work on the photodecomposition of benzoyl peroxide suggests that this 
reaction may have as a first step the formation of similar radicals to the thermal 

[ 481 ] 



482 


G. M. Burnett and H. W. Melville 

case, but since the investigation was carried out in the solid phase no conclusive 
evidence of this is available (Kohter & Schnider 1930). Diaoetyl peroxide has also 
been investigated, and the results obtained in solution and with the pure liquid 
suggest the formation of methyl and acetyl radicals, as the main products of the 
decomposition are ethane and methane, approximately in the ratio 1:1, in the case 
of the pure liquid (Walker & Wild 1937). By analogy a similar reaction might be 
possible for benzoyl peroxide. 


Experimental 

Benzoyl peroxide, which had been purified in the usual way, was made up as a 
standard solution in benzene. A known volume of this solution was introduced into 
a silica reaction tube and the solvent was removed in the vacuum from a water 
pump. The tubes were then sealed to a length of silica tube provided with a ground 
joint and attached to a suitable manifold. Vinyl acetate monomer, purified by 
fractional distiQation and thoroughly freed from air, was distilled into the tubes 
under high vacuum and the tubes sealed off. 

The rate of polymerization was followed dilatometrically, using a cathetometer 
reading to 0-01 mm. 


The photosemitized reaction 

With low concentrations of peroxide and temperatures of the order of 30 ® C the 
rate of the thermal reaction is negligible. The ratqs of polymerization using peroxide 
concentrations from 6x10""^ to 6x 10“^g.mol./l. were determined. At higher 
concentrations the thermal catalyzed reaction ceased to be negligible, so that the 
investigation of the sensitized reaction beyond this point was not made. Figure 1 
shows a typical run in which it will be noted that there is a clearly defined induction 
period, after which the reaction is apparently of zero order with respect to the 
monomer cdhcentration. When the source of illumination is removed the reaction 
continues for a short period. The fact that the rate curve for the dark period is the 
inverse of that of the induction period makes it certain that both of these observa¬ 
tions are real. Had the reaction been non-isothermal, then during the induction 
period the contraction due to the polymerization would have been offset to some 
extent by the expansion of the liquid itself. This would have given the appearance 
of an induction period. The reverse would be true during the dark period, so that 
the contraction during this period would be due to the polymerization and the 
contraction of the solution cooling down. The fact that the two stages are super- 
posable ensures that there is no non-isothermal effect. For comparison purposes the 
direct photopolymerization, which was measured under the same conditions, is 
also plotted in figure 1 curve II and is obviously negligible. 

At very low peroxide concentrations the rate of polymerization is proportional, 
for a constant light intensity, to the square root of the concentration. This is clearly 
seen from the figures in table 1. 



Velocity coefficients for polymerization processes. II 483 

At higher value of the peroxide concentration this simple dependence disappears, 
and the rate of polymerization becomes less dependent on the concentration of 
peroxide. After a certain concentration is reached the rate ^f polymerization begins 
to fall slightly as the peroxide concentration is increased. Table 2 shows the results 
obtained with the higher catalyst concentrations, in which this behaviour is 



Figubb 1 . The photosensitized polymerization of vinyl acetate, 
with benzoyl peroxide, at 30® C. 


Table 1. Effect of low peroxide concentrations on the 

RATE OF REACTION 


peroxide concentration 

C (g.mol./l.) 

max. rate B 
(% per sec.) 

BIG 

BIG^ X 10-5 

6-0 X 

0-00113 

1-87 

4-64 

1-2 X 10“» 

0-00160 

1-33 

* '4-66 

2-0 X 10“3 

0-00206 

1-03 

4-62 

4*8 X 10-3 

0-00350 

0-73 

5-04 


demonstrated. The falling off in the rate at this stage must mean that the peroxide 
molecules or one or other of the radicals formed must be capable of stopping the 
chains. The removal of the radicals or molecules in this way would lower the velocity 
of polymerization, first, by the premature removal of potential chain initiators and, 
secondly, by the shortening of the reaction chain. There is also the possibility of the 
removal of possible initiators by the formation of diphenyl or phenyl benzoate. 



484 


G. M. Burnett and H. W. Melville 


A somewhat similar situation was observed by Schulz (1941) in the case of the 
poljTuerization of both styrene and methyl methacrylate using free radicals formed 
by the thermal decomposition of tetraphenylsuccinodinitrile. In this case there is 
also a type of inhibition at high radical concentrations which he attributes to the 
addition of the dinitrile molecule to the double bond of the monomer. This will 
again result in the removal of radicals which have the ability to initiate chains. 

The intensity of radiation was varied by means of suitable perforated screens, of 
known transmission. The rates of reaction under the full intensity and under the 
reduced intensity were measured separately in the usual way. The dependence of 
the rate of reaction on the intensity of illumination is a sensitive test of the occurrence 
of inhibition, so that the intensity exponent was measured over a considerable range 
of peroxide concentration, in order that more information on the nature of the 
inhibition could be obtained. Table 3 shows the values of the intensity exponent 
obtained in these experiments. 

Table 2. Eetbct of high peeoxidb conoentbatiokts on the 

KATE OF REACTION 


peroxide concentration 

rate 

(g.moL/L) 

{% per sec.) 

xl02, 

xlO® 

9-5 

1-69 

19-0 

2-15 

29*0 

2*52 

3S-0 

2-46 

46*0 

2-31 

58*0 

2-23 


Table 3. The intensity exponent 
Screen transmission = 0-18. 


peroxide concentration 
(g.inoL/l.) (X 10^) 

log (RrIB,) 

4-76 

-0-337 

9-62 

-0-421 

19-0 

-0-440 

28*6 

-0-462 

38-1 

-0-650 

. 57-6 

-0-663 


intensity exponent 

0-48 

0-66 

0-69 

0-62 

0-88 

0-89 


Rf is the rate of polymerization imder full intensity, and under the reduced intensity. 


From these results it is evident that at the higher peroxide concentrations there is 
a side reaction which has a marked effect on the value of the intensity exponent. 
The trend of this effect may be somewhat surprising, but this point will be discussed 
in detail later. Under the conditions of reduced intensity the length of the induction 
period remains constant within the limits of experimental error. This is to be 
expected, since it depends solely on the value of the velocity coefacient for the 
initiation reaction, which would be imaffected by the change in intensity. 



485 


Velocity coefficients for polymerization processes. II 

It is not possible to vary the temperature over a very wide range, since at the 
higher temperatures the thermal catalyzed reaction will begin to have a considerable 
effect. In order to achieve as high temperatures as possible the concentration of 
the peroxide was maintained throughout at 4*8 x 10“^g.mol./l. This also reduces 
the inhibition reaction to negligible proportions. The results of these experiments 
have been plotted in the usual way in figure 2, from which the overall energy of 
activation is found to be 8 * 7 kcal./g.mol. It is not unjustifiable to assume that the 
energy of activation for the propagation process is unchanged from the bulk poly¬ 
merization, i.e. 4 * 40 kcal./g.mol., and the energy of activation for the termination 
step is zero, then the energy of activation for the initiation process can be found as 
follows. The overall activation energy, for a.reaction of this type, i.e. with 
mutual termination, is given by 

Eq = \E^ + E^^ 

where E^ is the energy of activation for the initiation process and E^ for propagation. 
Hence, substituting the appropriate values gives 8‘6kcal./g.mol. for E^, 



FiGtntB 2. Overall energy of activation for sensitized reaction. 

The velocity coeeeioieht for iottiation 

Since the reaction shows a weU-defined induction period it is possible to compute 
the value of the velocity coefficient for the initiation reaction directly. For the 
initial period the following equation must govern the concentration of the free 
radicals: 

^=^fiI)-k,(M){M), ( 1 ) 

where (B) is the radical concentration, the velocity coefficient for the initiation 
reaction, [M) the concentration of the inonomer, and /(/) a function of the light 
intensity. Integration of this leads to 


(2) 



486 


G. M. Burnett and H. W. Melville 


and the introduction of the condition that (iJ) = 0, when < = 0, gives the value of 
the constant of integration, c, ^ 

Mm}- 




h(M) 

Substitution of this in equation (2) and simplifying gives 


m) 


= e-W<, 


( 3 ) 


which can be written in the form 

*i(Jlf)(i?)=/(/){l-e-W«}. (4) 

The rate of reaction, r, will be proportional to the concentration of the radicals, 
( 5 ), so that ^ 


The maximum rate of reaction, r#, will be defined by setting t very large so that the 
exponential term disappears, and 


»'o 


-W 1 
~ W 


Hence 


1= 

^0 


or = 

Tq 

Hence if log (^o""^)/^o plotted against time the result ought to be a straight line 
whose slope will give the value of k^iM) and hence the value of k^. Table 4 gives the 
rate of reaction with the time, and from the figures it is evident that the linear 
relationship will hold. The half-life of the induction period is 201 sec., so that the 
value of ki{M) will be 4*97 x 10 ~®sec.-"^, from which, since (M) is lOg.moL/L, 

k^ = 4*97 X 10 "“^l.g.mol.“^sec.""^. 


Table 4. Rate oe eeaction over the indtjctioh period 


time 

rate, r 



(sec.) 

(% per sec.) 


log (l-r/^o) X 10 

40 

0-0020 

0-141 

0-93 

90 

0-0039 

0-278 

0-85 

140 

0-0052 

0-366 

0-79 

190 

0-0065 

0-457 

0-73 

240 

0-0078 

0-715 

0-64 

a40 

0-0100 

0-715 

0-44 


Tq = 0-0140 % per sec. 


Table 5. Hepehdehce oe k ^ oh temperature 

temperature (°C) 26-4 33*3 

ki (Lg.mol.~i sec.-i x 10^) 3.54 4-97 



Velocity coefficients for polymerization processes. II 487 

Using a similar method the value of was found at different temperatures and 
the values are given in table 5 , Using the usual logarithmic plot this leads directly 
to the value of the energy of activation for the initiation process, which is 8-6 kcal./ 
g.moL, which is in good agreement with the result obtained by the indirect method. 
Since k^ may be defined by 

it is possible to calculate the A factor making use of the results which have been 
obtained. This gives 

A^ = 5*9 X lO^Lg-mol.-^sec.""^, 

and hence it is possible to compute the steric factor. If the liquid is taken as being 
normal then the number of binary collisions per sec. will be of the order of 10^^, so 
that the steric factor is in the neighbourhood of 10~®. 


The bark reaction 

After the source of radiation is removed the reaction carries on for some time. As 
has been stressed this is not due to any non-isothermal effect. The cause of this effect 
appears to be the inherent inactivity of the radicals produced towards the mono¬ 
meric molecules. During the dark period the concentration of the radicals will be 
given by 


d{R) 


dt 


An analysis similar to that shown in the la^st section will give that the rate, r, at any 
time, t, is related to the rate, at zero time by the equation 




Table 6 shows the results obtained, and again it will be seen that the plot of log (r^/r) 
against time is linear. The value of the half-life of the reaction is 200 sec., so that 

k^ = 5 x 10 "^l.g.mol.‘"^sec."'^. 

This is ia good agreement with the previous result, from the half-life of the induction 
period. 

Table 6. The dabk BBAcnosr 


time 

rate, r 

(sec.) 

(% per sec.) 

20 

0-0129 

40 

0-0122 

60 

0-0112 

80 

0-0106 

120 

0-0095 

160 

0-0078 

220 

0-0067 

260 

0-0057 

300 

0-0049 

360 ■ 

0-0038 



log X 10 

0-92 

0-96 

0-87 

0-94 

0-80 

0-90 

0-76 

0-88 

0-68 

0-83 

0-56 

0-75 

0-48 

0-68 

0-41 

0-61 

0-35 

0-54 

0-27 

0-43 



488 


G. M. Burnett and JI. W. Melville 


The molectjlar weight of the polymbe 

Since there is no real interest in the absolute value of the molecular weight the 
relative degrees of polymerization were measured visoosimetrically to ascertain 
whether there is appreciable inhibition. Table 7 gives the results. The increase in 
the concentration of the catalyst would lower the molecular weight of the product 
in the normal course of events, since the chain length is inversely proportional to 
the square root of the concentration. Hence had the reaction been normal the third 
column of table 7 ought to be constant, but this is evidently not so. Since the value 
of the products decrease throughout, the value of the chain length must be less than 
the rate of starting would indicate, so that it can be concluded that there is an 
appreciable inhibition effect. Similar results were obtained by Schulz for the free 
radical initiated polymerization of styrene. 


Table 7. Moleoxtlae weight determination 


peroxide concentration 

C 

mol. wt. 

M 

MxC^ 

MxO 

6*0 X 10-^ 

76,500 

1780 

38*25 

1-5 X 10-3 

37,000 

1425 

66*50 

6*0x10-3 

11,000 

855 

66*00 

2-0 X 10-3 

6,200 

728 

104*0 


Formal analysis op the reaction 

For the sake of completeness two possible mechanisms will be analyzed; first, 
considering the radical and, secondly, the peroxide molecule as being responsible 
for the inhibition. 

I. Assuming that the radical inhibits the reaction: 

The reaction sequence for this mechanism would be 


B+hv-^2B 

m 


K 


Kx 


Kn 






Only the reaction after the induction period is completed will be considered, 
smce it is then possible to apply stationary state equations for the solution of the 
problem. 




Velocity coefficients for polymerization processes. II 489 

(1) When the inhibition by the radicals is negligible the terms involving kf may 
be omitted and it is then easy to show that 

(а) rate (peroxide cone.)*, 

(б) rate (/(/))*. 

(2) When the inhibition by the radicals is comparable with the termination by 
the mutual reaction, then the stationary state equations take the form 

and, generally, 

^ = Kn-f^Pn-l) m - {M) - k^{P^) {R) - k^(PJ Ek^fPs) = 0 . 

Su mmin g these equations to infinity gives the result 

k,iM)(R)-(R)Ek^{P,)-{Ek^iP,)}^ = 0 . 

Now, as before, write Sk^ = k^, and further postulate the relationship fikp^, 
so that 

nKSPs)Y+mPW)-W{R) = 0 , 

ftom which 

If, as is likely, ^{RY^^hi{M) {R)S^, this wfll reduce to 

^ - 2^2 J • 

For very long chains the rate of polymerization will be given by 

Since the value of (J?) must be proportional to some power of (J 5 ), the cclncentration 
of the peroxide, and also to f{I), it is obviously true that the rate is no longer de¬ 
pendent on the square root of either of these two quantities. 

( 3 ) To investigate the extreme case in which the termination by the radicals is 
very great, one can eliminate from the steady state equations the terms involving 

to give , p 

^ = k,{M) iR)-kp^iPI} (Jf)- 4 (Pi) (P) = 0 , 

^ = kpfP,){M)-kp^{P,)(M)-k^liP^){R) = 0, 

and, in general, 

^ = hp„JPn-i)iM)-kpJiP^)(M)-k^^(PJ{R) = 0, 



490 G. M. Burnett and H. W. Melville 

from which it is easy to show that 

UMr(B)hKs 

s^l 

HR) 



HR) 

-{l+fiiIt}/{M)}^- 

Hence the rate of reaction will be 

rate = fei(Jf)(i2) {i + ^(M)j(M) {1 + ^{It)l{M)}^ ^ ‘' 

Since the part in the bracket is a geometrical progression it can be summed to give 

rate — k (M) (iilf ^^R)l^ ] 

rate - k^^M) 

_ h{Mr 
~ fi ' 

which is obviously independent of both the concentration and the value of (/), 
i.e. the intensity exponent would tend to zero. 

II. Assuming that the peroxide molecule acts as an inhibitor; 

(i) As before for low values of the peroxide concentration it is easy to show that 
the rate of polymerization is proportional to the square root of the concentration 
and of the light intensity. 

(ii) When the in h ibition by the peroxide is of the same order as the chain breaking 
by the termination process a similar set of equations to those of section (1(2)) 
above will be seen except that (B) will be replaced by (J 5 ), the benzoyl peroxide 
concentration. The solution of these equations in the usual way leads to 

which is obviously the same equation as was obtained previously except that the 
intensity dependence will remain virtually unchanged. 

(iii) When the inhibition by the peroxide is very great, there is a marked difference 
from the result given in section ( 1 ( 3 )), and the analysis will therefore be given in 
detail. 

The stationary state equations will have the form 

^ = W(iJ)-^p,(Pi)(Af)-\(P0(5) = 0, 

^ = KnHRn-x){M)-kJ,PJ(M)-k^{P,)i£) = 0 , 



Velocity coefficients for polymerization processes. II 

»that.» before, 

and hence the rate will be given by the expression 


491 


rate = 


UM)\r) 

m 


From this it must be inferred that the rate of reaction becomes proportional to the 
first power of the light intensity and also inversely proportional to some power of 
the peroxide concentration which has a maximum of It would appear, therefore, 
that the second case fits the facts which have been presented so that one may 
conclude that in this reaction the first step is the decomposition of the peroxide 
into free radicals, which are capable of initiating chains of pol3?merization and that 
the peroxide molecules at high concentrations are responsible for the inhibition of 
the reaction. 

From the similarity of the kinetics of the reaction of both the thermal catalyzed 
and the photochemical reactions it can be taken as forming a link between the two 
reactions. From the recent work on the free radical polymerization of styrene, 

using free radicals such as diphenyl methyl nitrile, (CeH5)2. C. ON, it has been shown 
that the rates of polymerization and molecular weights obtained are of the same 
order as with benzoyl peroxide catalysis, so that there is substantial evidence that 
the latter is free radical in character. The sensitized reaction presents features which 
are similar to the thermal reaction in that there is a well-defined induction period, 
and is, at the same time, similar to the photoreaction in that the order of the main 
reaction is zero. Hence it may be deduced that the fundamental characteristics of 
the three reactions are the same, and that, in each case, the chain carrier is a free 
radical. 

One matter that has still to be decided is whether the initial photochemical 
activation of the vinyl acetate gives rise to the production of a diradical or whether 
the molecule is split into two radicals. In paper I it is shown that one molecule of 
polymer is produced for each molecule of vinyl acetate activated. This result can 
most readily be explained by supposing that a diradical is in fact produced and that 
termination at each end involves a disproportionization reaction. This result 
would, however, also be obtained if the vinyl acetate were dissociated and each 
radical grew separately, the polymer radicals finally disappearing by combination. 
There is no evidence to suppose that fission of vinyl acetate does in fact occur. For 
example in the gas-phase photopolymerization of vinyl acetate (Tuckett & 
Melville 1947) it can be shown that the molecule is simply activated, for the 
energy of excitation may readily be removed by collision with argon, helium or 
hydrogen. This would appear to indicate that absorption of a quantum does not lead 
to decomposition into free radicals. 

A more certain method of deciding the issue is as follows: Figure 3 shows the 
effect of benzoquinone on the course of the polymerization catalyzed by thermally 


Vol. 189 . A. 


32 



492 


G. M. Burnett and H. W. Melville 


decomposing benzoyl peroxide—curves II and IV. Curve I shows the course of the 
direct photo-reaction of such a velocity as to be equal to that of the catalyzed 
reaction at the end of the induction period. Curve III shows the calculated effect 
of adding precisely the same weight of benzoquinone. It will be seen that whereas 
the direct photo-reaction is completely inhibited by benzoquinone the benzoyl 
peroxide reaction is simply retarded although the quinone is removed and the 
reaction velocity approaches that in absence of added quinone. It is certain that 
the peroxide catalyzed reaction involves monoradicals only. It would appear 
therefore that the direct reaction must involve the production of diradicals, since 
it has already been shown that both reactions are of the free radical type. Although 
it is comparatively easy to formulate a structure representing this interaction of 
a diradical with benzoquinone it is not clear precisely how a monoradical will 
interact with the quinone. It may be that quinone adds on to the end of the mono¬ 
radical and the radical produced is inactive towards vinyl acetate, the radical 
finally disappearing by reaction with another of its kind. Alternatively the quinone 
might be incorporated in this chain thus forming an interpolymer with the vinyl 
acetate. 



m 



600 800 1000 1200 1400 1600 

time (min.) 


Figure 3, Effect of benzoqninone on (a) benzoyl peroxide catalyzed reaction at 51*7® C 
(curve IV), (6) direct photopol 3 nnerization (curve III), 


This disappearance in the action of benzoquinone in inhibiting or retarding 
polymerization is not confined to vinyl acetate. Similar experiments with styrene 
and with methyl methacrylate (M. H. Mackay & W. F. Watson unpublished) show 
the same discrimination. That is, in these reactions involving a monoradical there 
is retardation, in these reactions—^photochemical and thermal—^believed to involve 
diradicals there is inhibition. Consequently it would seem that the use of benzo- 
quinone may provide a useful method for deciding whether monoradicals or 
diradicals are involved in polymerization reactions. 





Velocity coefficients for polymerization processes. II 493 

The Gelva cycle 

By analogy with ordinary esters it would be expected that the hydrolysis of 
polyvinyl acetate would result in the formation of the corresponding alcohol, i.e. 



The reacetylation of the alcohol should then give the original ester. An investigation 
of the cycle showed that, in the case of catalytically produced polyvinyl acetates, 
hydrolysis brought about a decrease in the chain length, so that the reacetylated 
polymer had a degree of polymerization about half that of the original (McDowell & 
Kenyon 1940). Subsequent hydrolysis and reacetylation, cycles had no further 
effect on the chain length. It appears, therefore, that the chain rupture'takes place 
at only one point, thus suggesting that there is in the chain a less stable link than the 
normal carbon-carbon link. McDowell & Kenyon suggest that oxygen from the 
peroxidic catalyst used or acetaldehyde from the monomer is introduced into the 
polymer chain at a limited number of points. In photochemicaUy produced polymer 
formed in vacuo the first of these possibilities is avoided, so that a Gelva cycle was 
performed on a sample of polymer produced in this way. 

The hydrolysis and acetylation procedure was the same as that used by the above 
authors. After the regenerated polymer had been thoroughly dried the molecular 
weight was found viscosimetricaUy. The results are shown in table 8. These results 
show quite conclusively that there is no change in the degree of polymerization in 
the case of the photochemicaUy produced polymer. This would tend to show that 
the catalyticaUy produced polymer has a hydrolysable link due to the catalyst used 
in the production of the polymer. 


Table 8. Gelva cycle oh photopolymeb 



intrinsic 

chain 



viscosity 

length 

pnol. wt. 

original polymer 

0*92 

354 

30,500 

after cycle 

0-94 

362 

31,000 


One of us (G.M.B.) is indebted to DistiUers’ Co. Ltd. for financial assistance 
( 1945 - 6 ). 

Reeerences 

Fichter, F. & Sehoider, A. 1930 Hdv. Ghim. Acta, 13, 1428. 

Hey, Q. H. & Waters, W. A. 1937 Ghem. Ren>. 21 , 169. 

McDoweU, W. H. & Kenyon, W. O. 1940 J. Amer. Ghem, Soc. 62, 415. 

Schulz, G. V. 1941 Z. EUktrochem, 47, 265. 

Tuckett, E. <fc Melville, H. W. 1947 J. Ghem. Soc, (in the Press). 

Walker, O. J. & Wild, G. L. E. 1937 J, Ghem. Soc. p. 1132. 


32-3 



Determination of the velocity coefficients for 
polymerization processes 

III. Effect of environment on polymerization processes 

By G. M. Bvenett abtd H. W. Melville, F.R.S. 

Chemistry Department, University of Aberdeen 

{Received 7 November 1946 —Read 6 February 1947 ) 


The kinetics of the photopolymerization of vinyl acetate in ethyl acetate and n-hexane 
solutions T^ere investigated with a view to studying the effect of environnaent on the course 
of the reaction. The ethyl acetate solutions allow the pol 3 nner to remain extended so that the 
course of the reaction is similar to that of the pure monomer. With n-hexane, precipitation of 
the polymer takes place with the result that a considerable ‘gel’ effect is observed. The 
reaction rate in both cases is proportional to the monomer concentration and to the square 
root of the light intensity. In ethyl acetate solutions the activation energy is 4*42 kcal./g.mol., 
which is virtually the same as for the bulk reaction, while in normal hexane the value found is 
—1*30 kcal./g.mol. Beasons for this have been suggested. 

The life time of the active polymer in n-hexane solution both before and after the inception 
of the gel stage has been measured using the sector technique, and the values of the coeffi¬ 
cients for the propagation and termination reactions have been computed* It is found that 
the value of the former is little changed by the change in medium, while that of the termina¬ 
tion step falls sharply—^the fall being in agreement with the increase in rate. These results 
confirm present theories on the cause of the gel stage. 


Inteodtjction 

The introduction, of solvents into polymerization processes leads to a number of 
effects which depend, to a large extent, on the nature of the solvent used. The range 
of solvents which can be used in photochemical reactions is naturally limited to 
those whose ultra-violet absorption is considerably different from that of the 
compound under investigation. 

The effect which is of most interest here is the effect of the solvent on the course 
of the polymerization reaction. The case of the catalysed polymerization of methyl 
methacrylate in a variety of solvents has been fuUy examined (Norrish & Smith 
1942)- Is found that after a defiboite amount of conversion the rate of polymeriza¬ 
tion increases sharply, even although special care is taken to ensure isothermal 
conditions. This effect occurs at an earlier stage and to a greater extent, the poorer 
the solvent is with respect to the polymer, giving a maximum effect in the case of 
precipitants. The cause of this increase in rate is the main interest in this paper. The 
following suggestions have already been made without the necessary proof. The 
increase in the rate of reaction may be due to (a) increase in the rate of initiation, 

[ 494 ] 



495 


Velocity coeffi^cients for polymerization processes. Ill 

(6) increase in the rate of propagation, (c) decrease in the rate of termination. The 
first idea is easily discounted in view of the fact that the increase in rate is accom¬ 
panied by an increase in the molecular weight of the polymer formed, which is 
obviously contrary to the fact that there is an increase in the rate of starting the 
chains. (The increase in the rate has been termed the ‘explosive-like’ stage of the 
reaction by Schulz & Blaschke (1941), but we consider that this term is misleading, 
siuce the reaction does not involve the branching of chains, so that throughout the 
phenomenon will be referred to as the ‘gel effect’ since its appearance is bound up 
with the increasing viscosity of a solution, or the actual formation of a gel.) It is also 
very difficult to explain satisfactorily how an increase in the rate of the propagation 
step may take place. The decrease in the rate of termination may be readily accounted 
for, in the case of pure monomer, by the increase in the viscosity, putting a great 
enough restriction on the motion on the growing molecules to prevent the active 
ends from making contact and so preventing chain breaking. This would account 
for the increase both in the rate and the degree of polymerization. With solutions 
the phenomenon appears to depend on the coiling of the macromolecule in poor 
solvents, again preventing the contact of the active ends to terminate the growth of 
the chain. From this theory it is obvious that the rate and degree of polymerization 
will not increase if termination occurs by the anomalous addition of a monomer 
molecule. The photopolymerization of vinyl acetate in two solvents—ethyl acetate 
and 7i-hexane—^has been studied with a view to clarifying the position.. 

The second effect, which is probably not of very great importance here, is that of 
transfer, whereby the reaction of a growing polymer molecule with a solvent molecule 
results in the production of a dead polymer and a solvent fragment which is capable 
of initiating a fresh chain. This may be shown by the following reaction sequence: 


2 S —^reaction products 


initiation 

propagation 

transfer 


I 

II 

fllla 

[III6 

IV 


in which ilkf is a monomer and XS a solvent molecule, is the active polyifier 
molecule containing n monomeric units and is the corresponding dead polymer. 
For chain transfer to occur without a decrease in the velocity of polymerization 
reactions Ilia and III 6 will be fast compared to IV. If this is not the case then the 
velocity of reaction wiU fall, and in the extreme case where IV is very much faster 
than III& the solvent will act as a retarder. In any of these cases the degree of 
polymerization will fall. , 

For free radical mechanisms it is obvious that the easier the formation of free 
radicals by the solvent the more likely is transfer to occur. This question has been 
investigated by Mayo (1943) on the basis of results by Suessy Pilch & Rudorfer (1937) 
and Schulz & Husemann, (1939)- 



496 


G- TV[. Burnett and H. W. Melville 


Experimbntal 

The %-hexaine used was of spectroscopic purity. Ethyl acetate was purified by 
fractional distiQation, and the fraction boiling, under atmospheric pressure, at 
77 to 7 8°Cwas retained. The taps of the vacuum apparatus were greased with silicone, 
thickened with talc, ia the case of hexane. Both the solvents and the monomer were 
thoroughly freed from air by freezing in liquid air and evacuating, as was described 
in an earlier paper. The experiments were carried out Tmder vacuum in sealed 
tubes. 

The monomer which was thoroughly outgassed was distilled under high vacuum 
into a graduated tube from which a known volume could be distilled to the reaction 
tube. The solvent was then distilled in to make a known volume of solution. The 
reaction was followed dilatometxically using a cathetometer to measure the fall in 
the level of the meniscus, * 


Ethyl acetate solutions 

Solutions of the monomer in ethyl acetate were made up, as has been described, 
having concentrations from 1'6 to 4 - 5 g.mol./l. of vinyl acetate. Irradiation of the 
solution with ultra-violet light produced a ready polymerization: the course of the 
reaction is shown in figure 1. It will be seen that, as in the case of the bulk photo- 
pol3rmerization, the rate of reaction is initially constant. After some time the rate 
begms to fall off as might be expected as the monomer is used up. It will be noticed 
■ also that there is no sign of the inception of the gel stage in the polymerization, so 
that in this case it may be readily assumed that the macromolecule remahxs extended 
in solution, thus placing no very great restriction on the motion of the active ends 
of the polymer molecules. The steady state concentration of the active molecules 
must also be very rapidly established and the life time of the active species short, 
since the reaction ceases immediately on the removal of the source of radiation. 

Melville & Gee (1944) have shown that in reactions in which the termination is 
due to the interaction of two of the growing polymer molecules, the rate of polymeri¬ 
zation is proportional to the square root of the light intensity and also to the first 
power of the monomer concentration. Since the former may also depend on the 
monomer concentration to some extent, the order of the reaction with respect to 
the monomer may lie between 1 and l-S. To check this a variety of solutions were 
used and the rates of the initial polymerization measured, the results being set out 


Table 1. Eeeect of concentration of vinyl acetate 


rate (B) 
(g.nial./l./min.) 

concentration of 
monomer (C) 
(g.mol./l,) 

BIG 

RjOi 

BlCi 

/ 

0-0410 

4*5 

0*0091 

0-0193 

0-0043 


0*0279 

3*0 

0*0093 

0-0161 

0-0053 


0*0142 

1*5 

0*0095 

0-0116 

0-0077 




497 


Velocity coefficients for polymerization processes. Ill 

in table 1. From these figures it is evident that the rate of polymerization is pro¬ 
portional to the first power of the monomer concentration. This fact supports the 
assumption which was tacitly made in the calculation of the kinetic coefficients of 
the bulk reaction, and which was reported in a previous communication. 

The intensity of the radiation was varied by means of perforated metal screens the 
transmission of which was known and the rate of polymerization was determined, 
using, in all cases, solutions containing l-5g.mol. of vinyl acetate per litre. The 
results which are tabulated in table 2 gave a value of 0-51 as a mean for the intensity 
exponent. Thus the termination process is the same as before, i.e. by the interaction 
of two of the active polymer molecules. 



Figuikb 1. Course of pol 3 n 2 ierization of vinyl acetate in ethyl acetate solution. 


Table 2. Intensity dependence 


relative 

intensity 

0-42 

0-25 

0-16 

0*06 


•relative 

rate 

0-70 

0-48 

0-39 

0-25 


intensity 

exponent 

0-50 

0-52 

0-52 

0*49 



498 ' G. M. Burnett and S. W. Melville 

Earlier it was mentioixed that in solution there is the possibility of transfer. Even 
if this does occur it will not invalidate our conclusions as the following analysis 
shows. The sequence of reactions will be: 


M^P^ 

m 

I 

P^ + M-^Pn+i 

Kn 

II 

P^+X8^M^+8- 

Kn 

nia 

8-+M-^Pt 

h 

III6 

2/Sf—>? 

h 

IV 

Pn + Pa-^^n + ^a 




The stationary state equations will be 
^ = f{I) + kf{M) {S-)- {M) - (XS) - Xk^(Ps) = 0, 

® = fc^,{Pi) W-*p,(P2) = 0, 

and, in general, 

^ = KnJPn-x){M)-k^J,P^){M)-hJ,P^)(XS)-kJ,PJ = 0, 

and also the equation which will govern the concentration of the free radicals formed 
from the solvent in the transfer reaction will be 

= {XS)XK,{P,)-kf(M) {S-)~k,(8-r = 0 . 

Now if transfer is to occur readily then it is obvious that reaction III 6 is to be very 
much faster than reaction IV, so that the last term of this equation can be omitted. 
Hence the concentration of the free radicals will be given by 

(8-)=^{iX8)ZK,{P,)}lk;{M). 

Adding to infinity the stationary state equations given above, then 

m + k^iM)i8-)-{X8)XK,{P,)-{Xk^lP,)Y = 0. 

Substituting the value of (S—) reduces this expression to 
Now the rate of reaction will be given by 

Substituting the value of (/S —) and considering the chains to be long when it is 
possible to neglect the first term, this wUl give 

-diM)ldt = {X8)Zk4P,) + {M)Ek^^{P,). 



Velocity coefficients for 'polymerization processes. Ill 499 

If the usual assumption that the ratio of the coefficients is independent of the 
molecular size is made, then 

and = 

which will give the rate of reaction as 

and substituting the value of in this gives the final result 

It will, therefore, be seen that, provided the chains are eventually broken by the 
mutual reaction, the overall rate of reaction is still proportional to the square root 
of the light intensity. 

The rate of polymerization was determined at temperatures ranging from 31 to 
63° C, and gave the results shown in table 3. 


Table 3. The energy of activation 


temperature 

304-6 

313-6 

326-0 

336-3 


rate 

(arbitrary) 

0-84 

1-06 

1-44 

1-79 


E 

koaL/g.mol. 

4-45 

4-55 

4-28 


E = 4-42 kcal./g.mol. 


These results indicate that the photopolymerization of vinyl acetate in ethyl 
acetate solution follows a course which is identical with that obtained in the case 
of the bulk polymerization of the monomer under the same conditions. In view of 
the fact that the structure of the solute and solvent are so similar this is not alto¬ 
gether surprising. The results which have been given here tend to indicate that the 
light responsible for the polymerization is absorbed completely within the reaction 
system, since otherwise the rate of reaction would have been proportional to some 
power of the monomer concentration greater than unity, as has been pointed out in 
a previous paper. 

The rates obtained in this case are of the same order as those obtained in the 
polymerization of the pure monomer, so that it is unlikely that transfer can occur 
to any great extent. This is not wholly surprising, fpr the only method whereby 
transfer could occur would be by the removal of a hydrogen atom from the ethyl 
acetate which would be a relatively difficult matter. It is well established that com¬ 
pounds containing only carbon, hydrogen and oxygen are not efficient transferring 
agents. Thus it may be said that transfer, if it occurs at all, does not do so to any 
appreciable extent. 



500 


G. M. Burnett and H. W. Melville 
Polymerization in ti-hexane solution 

The course of the photopolymerization of ^^-hexane solutions of vinyl acetate is 
shown in figure 2. Almost immediately the light is switched on an opalescence is 
seen in the reaction tube and, as the time of radiation increases, a definite amount of 
polymer is formed on the walls and does not remain in suspension indefinitely. 
Prom figure 2 it will be seen that the rate of polymerization remains constant until 
a certain amount of polymer has been formed when the rate increases abruptly and 
continues to increase over a considerable period. The reason for this phenomenon 
will be discussed in detail later. After a further period the rate begins to fall off as 
would be expected from the decrease in the concentration of the monomer. 



time (hours) 

Figure 2 . Course of polymerization of vinyl acetate in n-hexane solution. 


Table 4. Dependence oe rate on concentration (ti-hexane) 


concentration of 


monomer (G) 
(g,moL/l.) 

rate 

(B) 

RIG 

BIG^ 

R/C^ xl02 

6-0 

0*0143 

0*0023 

0*0059 

0*098 

4-5 

0*0102 

0*0023 

0*0048 

0*107 

3-0 

0*0073 

0*0024 

0*0042 

0*140 

1-5 

0*0036 

0*0024 

0*0029 

0*196 


Concentrations of monomer fi:om 1*5 to 6g.mol./l. were used, and for purposes of 
comparison only the initial stages of the reaction were considered, i.e. up to about 
5 % conversion of monomer. This removes any ambiguity which might be attributed 
to the abnormal stage of the reaction. The results of the experiments are set out in 
table 4. Prom these figures it is obvious that the reaction is again of first order with 



Velocity coefficients for polymerization processes. Ill 501 

respect to the monomer concentration, at least, during the initial stages of the 
reaction. 

As before, the variation of the rate with the light intensity was measured by 
changing the intensity by means of perforated screens of known transmission, since 
the inception and extent of the gel stage of the reaction depends on the light intensity, 
as will be seen from figure 3, in which the upper curve is for high intensity. Here it 
is evident that the lowering of the molecular weight due to the high rate of initiation 
has effected the inception of the gel stage to some extent. This will be discussed in 
greater detail later. The results of the experiments are recorded in table 5. Again it 
will be seen that the rate of polymerization is proportional to the square root of 
the intensity of the absorbed radiation, since the intensity exponent is 0-50. This is 
not particularly surprising in view of the inert nature of the solvent. 



PiGTTBE 3. The effect of light intensity on the gel sta^e. Curve I, very high intensity. 

Curve II, low intensity. 


Table 5. Intensity depenbenoe in ^-hexane solution 


relative 

intensity 

0-42 

0-25 

0*16 

0-06 


relative 

rate 

0-69 

0-51 

0-41 

0-25 


intensity 

exponent 

0-48 

0-50 

0-49 

0-49 



502 


G. M. Burnett and H. W. Melville 

The variation of the rate of reaction with temperature was carried out over a 
considerable range of temperature and again only in the initial stages of the reaction. 
This is again absolutely necessary, since it is certain, as will be shown, that the 
abnormal stage in the reaction does depend on the temperature. The results are 
shown in table 6, while the usual logarithmic plot of the results gives the energy of 
activation as—1-30kcal./g.moL 


Table 6. The eneegy op activation in «-hexane solution 


temperature 

°K 

302-6 

309*0 

320*0 

333*1 


rate 

(arbitrary units) 

1*61 

1*53 

1*43 

1*32 


log 

(relative rate) 

0-086 
' 0-066 
0-033 
0-000 


E 1-30 kcal./g.inol. 


The negative value of the energy of activation is of considerable interest. In gas- 
phase poljnnerizations it is almost universally true that the overall energy of 
activation is negative. This case forms a parallel to the vapour-phase case m that 
the polymer, when formed, is precipitated, and hence the reaction must to a certain 
extent cease to be occurring wholly in the one phase. The solution case is relatively 
imderstandable, since the overall energy of activation must be made up of a large 
number of exp (EjBT) terms to cover the precipitation of the polymer, the ad¬ 
sorption of the monomer on the polymer, etc., as well as the usual terms for 
initiation, propagation and termination. 

The molecular weight of the polymer formed in the initial stages of the reaction 
(up to 5 % polymerization) and for the almost completed reaction (90 %) were 
found visoosimetrically using the value of in the Staudinger equation given by 
McDowell & Kenyon (1940). These results are shown in table 7. This method of 

Table 7. Moleculae weight op polymbe peom %-hbxanb solution 

intrinsic viscosity 

stage of reaction (in benzene) molecular weight 

before gel effect ^ 0*297 11,400 

after completion of reaction 0*405 15,600 

determination of the molecular weights was adopted, since there was no need to 
know the value of the num.ber average molecular weight, but merely to check that 
the effect was of the same nature as that found by Norrish & Smith. Since the low 
molecular weight polymer is still present in the final polymer, it is obvious that there 
must have been a considerable increase in the molecular weight of the polymer 
formed in the later stages of the reaction. If the Staudinger equation holds then 
it is'easy to show that 



Velocity coefficients for polymerization processes. Ill 603 

where 9/1 and 9/^ are the viscosities of the two fractions of concentrations and 
respectively. If the concentration of the low molecular weight material is taken as 
5 %, it is possible to obtain a somewhat crude estimate of the molecular weight of 
the final polymer formed. The value which is obtained by substituting the appro¬ 
priate figures in the above equation is about 20,000. This shows a considerable 
increase over the value obtained in the earlier stages of the reaction. Hence it may 
be concluded that this is a real gel polymerization. 


Life time of the active particle in t^-hexane 

To obtain the life time of the active polymer in 91-hexane the sector method, 
described in a previous paper, was used. The procedure had to be simplified some¬ 
what on account of the complex nature of the reaction. It is not possible, as previously, 
to vary the speed of the sector throughout a single run, but, as was pointed out, this 
is not necessary, nor, in this ease, is it particularly desirable. It will obviously be 
necessary to conduct complete runs with the sector running at a constant speed 
throughout, in order to arrive at results which will explain the curves obtained 
in the gel stage of the polymerization. The sector was run at a speed to give a flash 
of light of 7*56 msec, duration—a value which is considerably less than the life time 
of the active particle—and at a speed corresponding to 140 msec., when a rate 
lower than the former was obtained. By interpolation on the graph for the 
theoretical fxmction, a value for the absolute life time of the particle can be found. 



time (min.) 

Figum 4. Sector experiments. Curve I, flash duration =7*56 msec. 

Curve II, flash duration = 140 msec. 

Figure 4 shows the shape of the curves obtained. On account of the low light 
intensity the inception of the gel stage was naturally very rapid, but the curves are 
clearly enough defined to make the necessary calculations possible. It wiUbe seen that 
in curve II, in which the sector was rotating slowly, the initial rate of polymerization 



504 


G. M. Bumett and H. W. Melville 


is much less than in curve I, but eventually the two curves run parallel to one 
another. This obviously means that the value of the life time of the active polymer 
changes from somewhat less than 140 msec. to a value greater than this. 

From the analysis given in the previous paper it is possible from these results to 
determine the life time of the active species and hence to compute the values of the 
kinetic coefficients of the reaction. The change in the initial rates of polymerization 
corresponds to a value of 81 msec, for the life time, and We estimate the life time at 
some stage in the gel reaction to be 170 msec. This is reasonable, since it is known 
that the life time will increase throughout this stage from the fact that it appears 
from the results to do so continuously. This means that the rates in the initial and 
final stages of the reaction should be in a ratio of the order of 1 :4. This quahtatively 
fits the facts, but it still remains to calculate the coefficients of the propagation and 
termination reactions, fej, and respectively. The rate of polymerization which can 
be deduced from the contraction of the monomer during polymerization is 
5-0 X 10”®g.mol.l.“^sec.“\ and the rate of initiation is 6-0 x 10~® g.mol. 1.“^ sec.”^. 
Applying Melville & Jones’s ( 1940 ) approximate equation 

dt T ’ 

where (P) is the concentration of the active particles and t is the Mfe time, then 

(P) = 4-8 X 10~* g.mol./l. 

The overall rate of polymerization will be given by the equation 

-d{M)ldt = k^{P)iM), 

which by substitution of the appropriate numbers gives a value of 7-0 x 10® for kp. 
In the same maimer the value of k( is found to be 2-6 x 10®l.g.mol.“®sec.~^. 

Proceeding in the same way the values of the coefficients after the inception of the 
gel stage can be calculated. These results, together with those already published, are 
shown in table 8 . 


Table 8. Kinetic coefficients foe vinyl acetate polymbeization 



bulk 

normal 



reaction 

solution 

abnormal 

overall rate (g.moL/l./sec.) 

5*5x10-6 

5*0 X 10“« 

1*1x10-6 

initiation rate {g.inol./l./sec.) 

1*6 X 10-’ 

6*0 X 10-8 

6*0 X 10-8 

kj, (l.g.inoL-1 see.-i) 

6*7 X 102 

7*0 X 102 

6-8 X 10® 

kf (l.g.mol.-i sec.-i) 

2*5 X 10® 

2*6 X 10» 

5*0x108 

T (msec.) 

61 

81 

170 

* After 5 % polymerization. 


Gel polymbeization 

PEOCESSES 



In the catalytic polymerization of methyl methacrylate it was found that, even 
if the reaction was strictly isothermal, after a certain amount of pol 3 rmerization the 
rate of conversion of the monomer increased rapidly. At the same time there was 



Velocity coefficients for polymerization processes. Ill 505 

an increase in the degree of polymerization. As was explained before, because of the 
formal similarity between the reaction and the usual type of explosion reaction this 
has been termed the explosion stage, but to avoid unnecessary confusion in ideas, 
we have adopted the term gel pol;fmerization. The increase in the degree of poly¬ 
merization indicated quite clearly that the increase in the rate could not be due to 
an increase in the rate of initiation, since it is known that this is accompanied by a 
decrease in the molecular weight. Also it is not easy to conceive how it is possible 
for the rate of the propagation to increase. Hence it appears to be possible that 
the reaction step which is responsible for the increased rate is the termination 
process. 

The gel stage can be brought about very readily by the introduction of solvents. 
Norrish & Smith (1942) investigated this effect using a range of solvents which were 
both good and bad solvents for the polymer. They found that the inception of the 
gel stage was earlier the poorer the solvent, and was most marked in the case in 
which the solvent acted as a precipitant for the polymer. Thus in solvents such as 
heptane the increase in rate set in very early, while in more efficient solvents more than 
50 % conversion was required. The investigation was carried a stage further by 
Trommsdorff (1944), who dissolved in pure methyl methacrylate quantities of 
cellulose tripropionate. This had the effect of increasing the viscosity of the monomer 
and caused a much earUer appearance of the gel stage of the polymerization. 

The inception and extent of the gel stage in the polymerization was found to 
depend on (a) the concentration of the catalyst and hence the rate of starting of 
chains and the degree of polymerization, and (6) the temperature. In the case of 
vinyl acetate photopolymerization in ^-hexane solution, runs at different intensities 
showed that the higher the light intensity the more delayed was the inception of the 
gel stage, as will be seen from figure 3. The raising of the temperature had the same 
effect, and at sufficiently high temperatures the increased rate stage of the reaction 
could be eliminated altogether (figure 5). The increasing of the intensity of radiation 
(or of the catalyst concentration) will lead to a decrease in the degree of polymeriza¬ 
tion, and so, in the case of the polymerization of the pure monomer, lead to a decrease 
in the viscosity of the reaction medium as would an increase in the temperature. 
Hence it is obvious that the viscosity of the medium must play an important part 
in the course of the reaction. In the case of solvents it is evident that the formation 
of a gel or the coiling of the molecule is the feature which will be of most importance. 

The current theory due to Trommsdorff, Schulz and others is that the increase in 
the viscosity of the medium either in solution or in pure monomer will put consider¬ 
able restriction on the motion of the growing polymer molecules. The result of this 
restriction is that the active ends are unable to make contact and so break the 
reaction chains. The rate of the diffusion of the monomer molecules to the active 
ends will not be seriously affected by the change in the medium, so that the number 
of collisions between the monomer and the active end will remain virtually un¬ 
changed. Thus the propagation reaction is not seriously affected by the change in 
the medium. On the other hand, the lowering of the number of collisions between the 



506 Gr- M, Burnett and H. W- Melville 

active ends will reduce the termination rate, so that both the increase in the rate and 
the degree of polymerization can be accounted for. 

The degree to which the gel polymerization will be shown by monomers will 
depend on how sensitive the polymer is tct the restriction in its motion. If the polymer 
is not abnormally kinked the active end will be subjected to severe restriction in 
motion even if the actual restriction is applied at a relatively remote point. With 
kinked polymers the effect will not be so great. This probably accounts for the fact 
that the phenomenon is so clearly marked in the case of the methyl methacrylate, 
in which it is known that the polymer is not severely kinked- 



FiGtTRE 5 . Effect of temperature on the gel stage. Temperature = 65 ° C. 

In poor polymer solvents the polymer must be severely coiled so that it is most 
likely that a^imilar state of affairs will exist. The coiling of the molecule will prevent 
the active ends from coming into contact with the normal frequency, so that the 
rate of chain breaking will be reduced. From the results which were given in the 
previous section it is evident that the above theory is tenable. The value of the 
velocity coeflB.cient for the propagation step is not changed to any measurable 
extent during the abnormal part of the reaction and is not very much different from 
the value obtained in the case of the bulk polymerization. On the other hand, the 
value of the termination coefficient falls considerably—^in our case by a factor of 
about 5 . This means that the rate of polymerization should have increased by a 
factor of the order of 2 * 3 , since the rate is inversely proportional to the square root 
of the termination velocity coefldcient. This is also verified from the results which 
have been obtained. Trommsdorff, in his paper, applies this theory to explain the 
reason for the very high rates and degrees of polymerization encountered in the case 
of emulsion polymerization processes, ' 



507 


Velocity coefficients for polymerization processes. Ill 

The value of the temperature coefficient in the reaction can be explained by the 
fact that as the temperature is increased the velocity of the reaction may also 
increase, but this is offset by the fact that the stiffness of the gel or the viscosity of 
the solution is lowered, thereby decreasing the velocity, which will be controlled 
primarily by the stiffness of the gel. This will mean that the reaction will not be so 
temperature dependent as in the case of the bulk polymerization, and may, indeed, 
lead to a negative value of the temperature coefficient. The idea of the formation 
of a new phase having an effect on the course of the reaction is not new. As has been 
suggested, this must definitely happen in all vapour-phase polymerization processes, 
so that is is not wholly surprising that in these reactions the value of the overall 
energy of activation is negative. 

One of us (G.M.B.) is indebted to Distillers Co. Ltd. for financial assistance 
(1945-6). 


References 

Jones, T. T. & Melville, H. W. 1940 Proc. Boy, Soc, A, 175, 392. 

McDowell, W. H. & Kenyon, W. O. 1940 J, Amer, Ghem, Soc, 62, 415. 

Mayo, F. R. .1943 J. Amer, Ghem, Soc, 65, 2324. 

Melville, H. W.^& Gee, G. 1944 Trans, Faraday Soc, 40, 240. 

Norrish, R. G. W. <fc Smith, R. R. 194a Nature, 150, 336. 

Schulz, G. V. & Blaschke, P. 1941 Z, phys, Ghem, 50, 305. 

Schulz, G. V. & Husemann, E. 1939 Z, phys, Ghem, J^, 43, 385. 

Suess, H., Pilch, K. & Rudorfer, H. 1937 Ohem, A, 179, 361. 

TromtnsdorfF, E. 1944 Golloquium on high polymers, Freiburg. See BJ,O.S, Rep, no. 363, 
item no. 22. 


Vol. 189. A. 


33 



The dehydration of ammonium, potassium and 
some mixed alums 


By G. P. Acock, W. E. Gabitbr, E.R.S., J. Milstbo and H. J. Willavoys 

{Beceived 2 August 1946) 

[Plates 12 to 14] 

The shapes of the dehydration nuclei and their rates of growth have been investigated in 
hard vacuum and in the presence of water vapour. The activation energies and P-factors for 
the interface reaction in hard vacuum are those normally expected for solid reactions occur¬ 
ring at room temperature. The thermodynamic heat of dissociation is within ± 0-5 kcal. of 
the heat of activation, indicating a very close coupling between the process of evaporation 
of water and the growth of the new solid phases. The main process occurs layer by layer along 
the 100 planes of the hydrate. There is evidence of the production of a lattice defect of water 
in the hydrates before nuclei are formed. 

Water vapour slightly accelerates the rate of reaction at very low water pressures, but the 
main effect at higher pressures is a retardation which is probably due to the blocking of 
active areas in the interface by adsorbed molecules of water. 

In mixed potassium and chromium alums, the layer growth is inhibited and dehydration 
occurs ahead of the normal interface, as if the hydrates were behaving as zeolites. The 
reaction can occur in several stages and very complex phenomena are observed. 

Explanations are given of the abnormal reactions occurring with some hydrates, in which 
there is a breakdown of the close coupling between the dehydration process and the forma¬ 
tion of the new solid phases. 


IlTTEOB'CrOTION' 

During the dehydration of salt hydrates in a vacuum or over a dehydrating agent, 
nuclei are formed of a new solid phase on the surfaces of the crystals. The majority 
of recent investigations have been concerned with the study of the rate and of the 
mechanisms of the reactions occurring at the interface between the new solid phase 
and the original hydrate. The reaction is followed by measuring the change in 
weight of the crystal during dehydration, or by observation of the rate of movement 
of the interface. The activation energy for the evaporation of water from the inter¬ 
face and the JB-factor in the equation k = have been obtained for a number 

of hydrates. 

In some of the examples the values of £ and JS are of the order normally expected 
for solid reactions occurring at room temperature. Dor copper sulphate pentahydrate 
(Topley 1932a) and nickel sulphate pentahydrate (Garner & Southon 1935), the 
jB-factor lies in the region 10^^ and 10^^ mol./cm.^/sec. In the case of the former, 
jE? = 18^25 kcal., and is of a similar order of magnitude to the thermodynamic heat 
of dissociation, viz. 12*6 kcal. Carbonates dissociate similarly to hydrates, and in a 
number of examples the heat of dissociation is approximately the same as the heat 
of activation for the loss of carbon dioxide (Zawadzki & Bretsznajder 1938). Dor 
silver carbonate the same is approximately true (Spencer & Topley 1929) and the 
S-factor is normal. There would appear, therefore, to be one class of dissociations 

[ 508 ] 



Dehydration of ammonium^ potassium and some mixed alums 509 

for which the jB-factor is of normal magnitude (^ dissociation 

approximately the same as the heat of activation. 

On the other hand, calcium sulphate hexahydrate (Topley & Hume 1928) gives a 
high activation energy, J 2 / = 38 to 40 kcal. and a S-factor of 10 ^«, and for chrome alum 
(Cooper & Garner 1940) E ^ SI kcal. as compared with 11 kcal. for the heat of 
dissociation and a jB-factor of 10^®. Potassium hydrogen oxalate (Hume & Colvin 
1929) also gives an abnormally high value for the activation energy, ^ = 40 kcal. 
No satisfactory explanation has yet been given of this abnormal behaviour. 

If the crystals have been carefully grown and the surfaces are free from scratches, 
there is normally an induction period before nuclei inake their appearance. In 
extreme cases, hydrates have been kept for several days in a high vacuum in the 
neighbourhood of room temperature without visible change occurring. This indicates 
that the activation energy for the removal of single water molecules from the surface 
of the crystal must be very high, but so far there has been no accurate estimate of 
its magnitude for any hydrate. This is in mai^ked contrast to the removal of water 
from the interface, where the activation energy may be as low as the heat of 
dissociation. 

The nuclei frequently adopt striking geometrical patterns with sharply defined 
contours. An extended investigation into the shapes of nuclei on the known faces of 
copper sulphate pentahydrate (Garner & Pike 1937) showed that the shapes on dif¬ 
ferent crystal faces are often markedly different, and«fche work indicated that growth 
along certain crystal axes was preferred. For nickel sulphate heptahydrate linear 
nuclei were obtained on the surface, which spread into the interior of the crystal 
along a preferred plane. On the other hand, in the case of chrome alum (Cooper & 
Garner 1940) hemispherical nuclei are obtained, where the rate of growth of the 
nuclei is the same for all crystal directions. 

A factor which complicates the study of the rates of growth of nuclei is that of 
the impedance to the escape of water, due to a fine state of division of the solid 
product". This may lead to a decrease in the rate of growth as the thickness of the 
layer on product increases. Impedance is marked in the case of copper sulphate 
pentahydrate (Topley 1932a), where the product has been shown to be amorphous 
when examined by X-rays (Kohlsohiitter & Nitschmann 1931). Manganous sulphate 
bihydrate also gives an amorphous product (Smith & Topley 1935; Volmer & 
Seidel 1937), which crystallizes in the presence of water vapour, and in this case a 
very abnormal rate of growth curve is obtained in the presence of water vapour, as 
shown when the rate of growth is plotted against pressure of water vapour. This is 
not, however, the general rule, for in other cases, notably the alums, there is no 
appreciable impedance to the escape of water vapour due to increasing thickness 
of product. The phenomena which are observed must depend on the compactness of 
the layer of solid product. If this is free from cracks or capillary passages of large 
size, then some impedance might be expected. 

In the present investigation a number of alums has been examined, with a view 
to a study of the interrelationship of the various factors, e.g. jB-factor, activation 

33-2 



510 G. P. Acock, W. E. Garner, J. Milsted and H. J- Willavoys 

energy, impedance and shape of nuclei. It was hoped thereby to fill in certain gaps 
in the body of information on nuclei formation and growth, and to attempt a broad 
generalization of the nature of the mechanism of the dehydration processes. The 
dehydration of potassium and ammonium alums and solid solutions of potassium 
aluminium and potassium chromium alums have been studied, using the technique 
developed by Cooper & Gamer (1940). Measurements have been made of the rate 
of growth of nuclei, and of the rate of penetration of the interface into the interior 
of the solid for a range of temperatures. The thermodynamic heat of dissociation 
has been measured for ammonium alum. The shape of the nuclei has been investi¬ 
gated on the diflferent faces of the alum crystals, and in this connexion some work 
has been done on rubidium and caesium alums, and on mixed alums containing 
common alum and chrome alum in solid solution. The effect of variation of water 
pressure on the rate of reaction has also been examined. 


Expbbimeotal 

Measurements of the rates of growth and observations on the shapes of nuclei 
have been made in the apparatus previously described in the paper on chrome alum 
(Cooper & Gamer 1940). The crystals were grown in a dust-free crystallization 
chamber, dehydration of the solutions being effected over solid sodium hydroxide. 
They were grown slowly and only crystak with perfect faces were used. They were 
drained from excess of solution and dried by the method previously described and 
stored in desiccators over the powdered alum. Crystals of 1 to 2 cm. across were 
employed. 

Shapes of nuclei formed in vacuum 

The nuclei formed on the 111 face of the crystals are roughly hexagonal in shape, 
with marked diagonal cracks. Typical nuclei for the series ammonium, potassium, 
rubidium and caesium alums are shown in figures 1 and 3, plates 12 and 13 . For 
ammonium, potassium and rubidium alum, the alternate sides of the hexagon are 
concave and the other three sides give very nearly a straight edge, but with an 
occasional tendency to concavity and with an overhanging haze. The haze, under 
higher power, gives the appearance of being composed of very fine lines. The nuclei 
for caesium alum are somewhat more complex in structure, since three of the faces 
of the hexagon are poorly developed (figure 36 ). The nuclei all possess trigonal sym¬ 
metry and are aligned with their straight edges parallel to the edges of the 111 face; 
hence they form an approximate reproduction of the octahedral face of the crystal. 

On the cubic face, 100, the nuclei are square with diagonal cracks, giving the 
appearance of a square pyramid (figure lc,ci!). The sides of the square are parallel 
to the intersection of the cubic face with the octahedral face. On the dodecahedral 
face. Oil, the nuclei are approximately rhombic with two of the corners rounded 
(figure 16,/). The shapes of the nuclei in relation to the geometry of the crystal are 
shown in figure 2. 



Dehydration of ammonium^ potassium and some mixed, alums 511 

The cracks in the nuclei possess some significance, for their formation indicates 
a regular shrinkage of the product, due to recrystallization. This can be readily 
interpreted if the microcrystals of the product were regularly arranged with respect 
to the edges of the nuclei. This is what would be expected to occur if the dehydration 
takes place preferentially along one set of planes (100) of the hydrate, and the product 
was built up along these planes. A miass of similarly oriented crystals would be 
J)roduced which, on recrystallization, would contract in a regular manner. 

For ammonium alum, slOT^-groXving spherical nuclei were also formed, especially 
over the lower temperature range. In this type of nucleus there are no cracks. The 
microcrystals of the product, are therefore less regularly arranged and when re- 
crystaUization occurs there is no shrinkage of the mass as a whole, the changes being 
accommodated between the grains. It is significant that the spherical type of growth 
might be expected to yield microcrystals arranged with their axes pointing in all 
directions. 



Effect of water vapour on shapes of nuclei 

As the water vapour pressure over the crystal increases, the nuclei gradually lose 
their hexagonal shape and finally become very irregular on the periphery (figure 

3 d, 6 and/, plate 13 ). At the same time the cracks disappear and the product becomes 

visibly crystalline in character, indicating water vapour facilitates the growth 
of the microcrystals, linking them together irregularly and thereby preventing 




512 G. P. Acock, W. E. Gamer, J. Milsted and J. H. WUlavoys 

shrinkage. In the presence of water vapour there is no longer a marked preference 
for growth along the 100 planes. In figure 3 c, plate 13, a photograph is shown of the 
nuclei on iron ammonium alum, produced in hai’d vacuum, which resembles those 
of other alums when grown in the presence of water vapour. 

Nuclei on the surfaces of mixed alums 

A study was made of the nuclei formed on five solid solutions of potassium alum 
and chrome alum, and tj^ical photographs are given in figure 4 , plate 14 . It was 
observed that as the percentage of chromium increases, nucleation becomes more 
.difficult, especially over the lower temperature range (16 to 30 ° 0 ), and that at the 
same time there is a marked increase in the temperature coefficient for nuclei 
formation. The hexagonal structure and the radial cracks of the nuclei on the 111 
face are absent and spherical nuclei with concentric cracks are obtained. The 
growth of these nuclei is discontinuous, wisps of material being developed ahead of 
the interface at certain stages during the growth. These isolated regions grow both 
forwards and backwards but never quite join up with the original interface, and they 
give rise ultimately to what is apparently a concentric crack. Typical nuclei for 
30,40 and 50 % chromium are given in figure 4 a, 6 and c, plate 14 , and it will be seen 
that they all have the same character. At 68 % chromium, the nuclei become very 
complex and their character depends on the temperature of dehydration. At 
35 to 40 ° C spherical nuclei are formed which resemble those described above in their 
method of growth, but a mosaic structure is developed in the nuclei (figure 4 d!). 
At 30 ° 0 and below a new type of nuclei is obtained, whether the nucleation is 
started above 30 ° 0 or not. This is a' cluster ’ type of nucleus which is sometimes set 
in a paler matrix. There are apparently two separate processes of dehydration, the 
interrelationship of which has not been fully worked out; there is a rapid spread of 
dehydration process with a very faint outline and in this whiter nuclei are sub¬ 
sequently formed (figure 4 e). Sometimes the faint dehydration is not seen and 
nuclei similar to those in figure 4 / are obtained. In this series are also obtained 
nuclei which show evidence of Liesegang phenomena occurring in the solid product 
(figure 4 gf). 

When the percentage of chromium is still greater (86 %), the nuclei resemble those 
obtained with chrome alum, giving slow and fast growing spherical nuclei. There 
is, however, evidence of the presence of a mosaic structure in the faster growing 
nuclei (figure 4 ch). 

It would appear that the replacement of aluminium Tby chromium leads to the 
disappearance of the processes of dehydration occurring along crystal planes and 
to the production of spherical nuclei in which growth has certain discontinuous 
features. The nuclei show concentric cracks or, in certain cases, a mosaic structure. 
Dehydration occurs ahead of the normal interface, possibly by the removal of water 
without lattice collapse, as in zeolites, and then subsequently nucleation occurs in 
the partially dehydrated material. ^ 



Dehydration of ammonium^ potassium and some mixed alums 513 

Rates of growth 

The rates of growth were measured in vacuum and also in the presence of water 
vapour at pressures lower than that of the dissociation pressure. In vacuum there 
is an initial induction period before nuclei appear, which decreases with increase in 
* temperature, and for ammonium alum the length of the induction period is 100 min. 
at 15° C and 3 to 4 min. at 40° C. For mixtures of aluminium and chromium alums 
(Cr = 68 %) there are no nuclei produced at 25° C in 24 hr., but at 35° 0 nuclei are 
formed within a few minutes. Photographs of the nuclei were taken with a Winkel- 
Zeiss microscope on a Kodak super X-X panchromatic film at intervals during an 
experiment. When the nuclei were hexagonal or square in shape, they were measured 
on the film along the diagonals, the average lengths of the diagonals being taken. 
The plot of the diameter of a nucleus against time practically always showed an 
accelerating rate of growth when the nuclei were small (figure 5), the curve becoming 
hnear and remaining so during the remainder of the growth. The initial rate of 
growth is approximately double the final value, and the difference is too great to 
be accounted for by the cooling of the crystal which occurs as the dehydration 
proceeds. Also, the abruptness of the change in rate is not in accord with the hypo¬ 
thesis that it is due to the self-cooling of the crystal. It was observed in the case of 

the alums that nucleation ceased after a certain number of nuclei had been formed. 

% 



The temperatures of dehydration ranged from 15 to 40° C, and experiments with 
several crystals from a number of batches were made at each temperature. The 
average values for the rates of growth of nuclei for aihmonium alum on the 111, 100 




514 G. P- Acock, W. E. Gamer, J. Milsted and H. J. Willavoys 

and Oil faces are summarized in table 1 and for potassium and rubidium alums in 
table 2. The temperatures are uncorrected for the self-cooling of the crystal during 
dehydration, but the experimentally measured self-cooling is given (see later). 


Table 1. Rate of nuclei oeowth for ajvimonium alum in vacuum 


face of 

shape of 

temp. 

self-cooling 

rate of growth 

crystal 

nuclei 

ro 

ro 

(cm./min. x 10“*) 

111 

hexagonal 

20 

1-10 

1*42 



26 

1-80 

1-96 



.30 

2-75 

2*83 



35 

4*16 

4-11 



40 

6-20 

5-36 

111 

spherical 

16 

0-66 

0-68 



20 

MO 

0*97 



26 

1*80 

1-62 



36 

4*16 

2^96 



40 

6-20 

3-90 

on 

rhombs 

26 

1-80 

1-39 

100 

square 

40 

6-20 

6-10 


Table 2. Rate of nuclei growth for potassium 

AND RUBIDIUM ALUMS IN VACUUM 


face of 

shape of 

temp. 

self-cooling 

rate of growth 

crystal 

nuclei 

’ CO 

rc) 

(om./mm. x 10“*) 



potassium alum 



111 

hexagonal 

15 

— 

0*714 



20 

1*20 

M21 



26 

2*60 

1*76 



30* 

3*66 

2*44 



36 

4*60 

3*69 



40 

5*15 

6*03 

110 

squares 

40 

6*16 

4*98 



rubidium alum 



111 

hexagonal 

26 

— 

1*37 



28-1 

— 

1*66 


It will be noted that there is a decline in the rate of growth of the nuclei on the 
111 face in the order NHg, K and Rb. From the photographs in figure 1, plate 12 it 
will be seen that the decrease is more marked in the direction of the concave edges 
of the nuclei than along the diagonals. The atomic weight of the monovalent ion is 
obviously a factor influencing the rate of dehydration. For both ammonium and 
potassium alums the rate of growth along the diagonal of the 111 face is practically 
identical with that along the diagonal on the 100 face. The growth for the Oil face 
is definitely slower than the rate on the 111 face. 



Dehydration of ammonium, potassium and some mixed alums 515 

The results in water vapour were obtained by first forming nuclei in hard vacuum 
and then continuing the growth in a constant pressure of water vapour. The change¬ 
over in rate of growth on introducing water vapomr was very rapid. The measured 
rates of growth are given in tables 3 and 4, and the results, after correction for self- 
cooling, are plotted in figure 6, together with the earlier results for chrome alum, and 
it will be seen from the maximum on the curve that there is some catalysis of the 
rate of dehydration by water vapour at low pressures for potassium alum at 30° C 
but that the effect is less than for chrome alum. There is a doubtful trace of catalytic 
effect for anamonium alum at 30° 0, but there is no appreciable influence With this 
hydrate at 40° C. The self-cooling of the crystals was measured and the rates of 
growfth have been corrected on the assumption that the temperature coefficient of 
the rate of growth is not affected by water vapour. 


Table 3. Rate of nttolei growth fob the 111 face 

OF AMMOITITJM ALHM BST WATER VAPOUR 



vapour 

rate of growth ^ 

vapour 

rate of ^growth 

temp. °C 

pressure 

(om./min. temp. 

®C pressure 

(cm./min. 

xincorr. 

(mm.) 

X 10-*) 

uncorr. (mm.) 

X 10-*) 

40 

0*00 

5*36 

30 

0*00 

2*83 


0*035 

4*67 


0*014 

2*80 


0*313 

2-85 


0*030 

2*19 


0*351 

2*41 


0*054 

2*14 


0*740 

. 1*75 


0*168 

1*36 


2*130 

0*76 


0*476 

0*66 





1*950 

no growth 


Table 4. Rate of huolei growth on the 111 face of 


POTASSIUM ALUM IN' THE PEBSENOE OF WATEB VAPOUE 



temp. 

vapour pressure 

rate of growth 



(°C) 


(mm.) 

(om./mm. x 10“*) 



30 


0*0 

2*44 





0*0296 

2-47 





0*0966 

2-17 





0*286 

Ml 





0*476 

0*72 





0*776 

0*142 



Table 6. Rate of growth 

ON 111 PACE 


percentage 


rate of growth on 111 face (cm./min. x 10-*) 





A 



01 cnroino 

r 




> 

alum 

15® C 

20® C 

25® C 

30® C 35® C 

40® C 

0-0 

0*714 

1*121 

1*75 

2*44 3*69 

5*03 

30*23 

_ 

0-352 

0*56 

— — 

— 

39*75 

_ 

— 

0-437 

0-690 1*12 

— 

50*40 

_ 

— 

— 

— 0*63 

1*20 

67*98 

— 

—. 

(3*30) 

(2*40) (4-93) 

(8*20) 

85*8 

— 

(0*927) 

(2*2-4*5) 

(6*2-7*7) — 

— 

100*0 

1*10 

2*03 

3*68 

5*27 — 

— 



516 Gr. P. Acock, W. E. Gamer, J. Milsted and H. J. Willavoys 

Bate of grmoth in mixed ckromium-aluminium alums. The rate of growth of the 
nuclei is slightly depressed as the percentage of chromium increases up to 50 %. 
At higher percentages of chromium the formation of nuclei is complex in character, 

water-vapour pressure in mm. (chromium alum) 



water-vapour pressure in mm. 

(potassium and ammonium alums) 

Figure 6. Rates of growth of nuclei in water vapour. 


and variable results for the rates are obtained due to the complex interrelationships 
between two processes of dehydration. In this region ^mosaic’ and 'cluster’ nuclei 
are obtained. 




Dehydration of ammonium^ ^potassium and some mixed alums 617 

Effect of water on the rate of dehydration 

In a hard vacuum, the quantity of water adsorbed in the interface will be small, 
especially where the interface intersects the surface of the crystal. It is shown later 
that there is no change in the impedance to the escape of water as the thickness of 
product increases, so that the quantity of water adsorbed in the interface is probably 
uniform over the whole area. Assuming that the adsorption in a hard vacuum is 
practically zero and that water retards the dehydration process by occupying 
places in the interface into which water molecules are released from the hydrate 
lattice, then it might be expected that the rate of dehydration in the presence of 
water vapour would be proportional to the places unoccupied by adsorbed water 
molecules. The fraction of places unoccupied can be calculated from the Langmuir 
adsorption isotherm, for which Phso/Q = ^•-PhjO + const., where Q is the fraction 
of the places in the interface occupied by water. When 6=1, then the rate = 0, 
and when 6 = 0 then r^ = and for intermediate values of Q, rQ—= kQ, A plot 

Ph2o/(^o~"^ 3)) against gives a straight line for the two series of experiments 
with ammonium alum (figure 7), which is in agreement with the proposed mechanism. 
A similar treatment has been given by Topley (19326) and Smith & Topley (1931) 
to the results obtained by them for the dehydration of copper sulphate pentahydrate 
in the presence of water vapour, and they find that there is a linear relationship 
between the rate of dehydration and the amount of water adsorbed in the interface 
(see figure 7). 

Unfortunately, this treatment cannot be applied to the results obtained with 
potassium and chrome alums on account of the maxima on the rate curves. The 
maxima are due to the acceleration of the rate of dehydration at very low water 
vapour pressures. It is well known that water catalyzes the crystallization of the 
solid products formed in dehydration processes so that the maxima on the curves 
for potassium and chrome alums might be ascribed to impedance of the flow of water 
vapour caused by a transition layer of amorphous solid, which crystallizes in the 
presence of water vapour. 

Bate of penetration of the interface into the crystal 

This was measured on a McBain-Bakr quartz spring balance with partially 
dehydrated crystals which were covered with the product of the reaction over the 
whole surface. The technique has been described previously. Por chrome alum, 
which gives henaispherical nuclei, it has been shown that the rate of growth along 
the surface was equal to that of penetration into the crystal, and that the rate of 
penetration was independent of the thickness of the solid products. Prom this it 
was deduced that there was no change in the impedance to the escape of water as 
the layer of product became thicker and that if any impedance was present it was 
limited to a transition layer. In the present experiments, the rate of penetration 
was also found to be independent of the thickness until the reaction was nearly 
complete, which shows that the type of impedance found with copper sulphate 



518 Gr. P. Acock, W. E. Gamer, J. Milsted and H. J. WiUavoys 

pentahydrate was absent. Twenty out of the original twenty-four molecules of water 
on E^SOi, Al2(S04)3.24Ha0 are lost, the final product holding on firmly to the 
residue of four molecules. There was a tendency for slightly greater losses at the 
higher temperatures (see tables). The rates of penetration are given in tables 6 
and 7 and compared with the radial growth of the nuclei over the surface. 



The rate of penetration into the crystal is definitely less than the rate of growth 
along the surface, but the ratio of the rates is practically constant at all tempera¬ 
tures and is almost identical for potassium and ammonium alums. It is unlikely 
that this constancy of ratio would have been obtained if there were any impedance 
to the escape of water vapour, since this varies with temperature. It has been shown 
that the rate of reaction is different in different directions in the crystal, and the 




Dehydration of ammonium, potassium and some mixed alums 519 

slower rate of growth perpendicular to the surface is probably very largely due to 
this effect. Although no quantitative calculation can be made on the rate of growth 
into the interior of the crystal from the available data, it will be noted that on the 
111 faces of ammonium and potassium alums the ratio of the minimum to the 
ma?:imum rate of growth is approximately 0 - 7 , so that ratios of the order given in 
tables 6 and 7 are not unexpected. The activation energjr for the dehydration 
reactions is derived later, after correction for self-cooling of the crystal. 



Table 6. 

Rate oe pefeteation eor ammonium alum 





experimental 

rate from 



self- 

number of 

rate 

nuclei B^ 


temp. 

, cooliag 

molecules 

(cm./min. 

(cm./min. 


rc) 

ro 

lost 

X 10-^) 

X 10-^) 

BJB 2 

20 

M 

19*8 

0*35 

0-71 

0-50 

25 

1*8 

19*4 

0-54 

0-975 

0-55 

30 

2*75 

20-0 

0-76 

1-415 

0-53 

35 

4-15 

20-4 

M3 

2-125 

0-53 

40 

6*20 

20*6 

1-42 

2-68 

0-53 





av. 

. 0-53 


Table 7. 

Rate of penetration foe potassium alum 





experimental 

rate from 




number of 

rate 

nuclei B^ 


temp. 

. self- 

molecules 

(cm./min. 

(cm./min. 


ro 

cooling 

lost 

X 10"^) 

X 10“^) 

BJB 2 

15 

— 

— 

0-210 

0-357 

0-59 

20 

— 

20-0 

0-300 

0-565 

0-54 

30 

— 

20-2 

0-715 

1-22 

0-59 

40 

— 

20*7 

1-374 

2-52 

0-55 


av. 0*57 


One run was carried out at 35 ° C for the mixed alum containing 50 % chromium 
and 20-87 mol. of water were lost, showing that the reaction follows a similar course 
to that in potassium alum: the rate of penetration was 0*565 x 10 “^ cm./min., which 
was in good agreement with the value for the rate of growth of the spherical nuclei 
as measured on the surface, viz. 0-556 x 10“^ cm./min. The solid solution containing 
68% chrome alum gave a loss of 15-8 to 15-9 water molecules, which indicates a change 
in the nature of the process of dehydration at the higher chromium content. The 
rate of penetration at 35 ° was 0*917 x 10 “^ cm./min., which is much less than that 
for the radial nuclei growth. If the dehydration of the crystal was stopped half-way, 
it was found that the actual penetration, as observed on a broken crystal, was much 
greater than the value calculated from the loss of weight as measured on the spring 
balance. There were seen to be two layers, the inner layer purple in colour and the 
outer layer greenish blue. It was not possible accurately to ascertain the number 
of water molecules lost in the two stages, but it was deduced that approximately 
8 to 12 mol. were lost in the first stage. The rate of penetration for 85 * 8% chrome alum 



520 G. P. Acock, W. E. Gamer, J. Milsted and H. J. WiUavoys 

could also not be accounted for except by the assumption of two successive dehydra¬ 
tion processes. The rates of penetration as measured on the spring balance were 
26° C, 1*06 cm./min. x 10“* and at 30° C, 1-59 cm./min. x 10“*, and when measured 
from a broken crystal, higher values were obtained, indicating the occurrence of 
two processes, the first occurring with a loss of 12 water molecules and the second 
with a loss of 4 mol. Two stages in the process of dehydration have been previously 
found for copper sulphate pentahydrate (Garner & Tanner 1930). 

Heats of dissociation 

The dissociation pressures were measured for ammonium alum by the method 
previously described. Large single crystals, 2 cm. across, were employed with the 
interface at different depths below the surface of the crystal. The dissociation 
pressures were measured from 20-0 to 42° 0 (table 8). Logp is plotted against 1/T 
in figure 8, from which the heat of.dissociationis deduced to be 15‘66 cal./mol. HjO. 


Table 8. VAPOTTE-PEESStrB.B MEAStrRBMBKrTS OK AMMOKIXJM ALXTM 


condition 

temp. (°C) 

pressure (cm.) 

thick penetration 

42-6 

0*538 


40*1 

0*484 


37*5 

0*402 


36*0 

. 0*347 


32-5 

0*287 

thick penetration 

42*6 

0*653 


40-1 

0*512 


37-6 

0*410 


36*1 

0*350 


32*5 

0*264 


30-0 

0*2 U 


27-6 

0*156 


25-5 

0*128 


22*6 

0*100 


42-8 

0*664 


21*2 

0*077 

thick penetration 

42-6 

0*570 


30*2 

0*214 


22-0 

0*080 

new crystals, siirface 

42-4 

0*550 

nucleation only 

36-2 

0*375 


20-4 

0*073 


30*1 

0*199 


25-5 

0*136 


20-0 

0*087 


The self-cooMng of the crystal was measured by means of a thermocouple placed 
between two crystals carefully cemented together. When the reaction occurs at an 
interface completely surrounding the two crystals, the thermocouple will be in a 



Dehyd/fotion of a/mvionium, potassium and some mixed alums 621 

constant temperature enclosure. Tlie values for self-cooling recorded in the tables 
are probably somewhat too low on account of the use of too thick copper wire in the 
copper-constantan thermocouple, so that the self-cooling has also been calculated 
by the method due to Smith & Topley from the equation 

K{T^-Ti) = NH, 



Figure 8. Dissociation pressures of ammonium alnm. 

O deep penetration, * light penetration. 

where JJjisthetemperatureofthe thermostat, jfithatoftheorystaljjS^ = 1-37 x 
N the number of water molecules evaporated per sq.cm, and H the heat of dissocia¬ 
tion of the hydrate. This assumes that the dehydrated product behaves as a black 
body, and these calculations may therefore give values for the self-cooling which 
are too high. Both the experimental results and those calculated assuming black 
body conditions were used to correct the rates of growth of nuclei and the rates of 
penetration, and in the calculation of the activation energies for dehydration. The 
values are given in table 9. 

The agreement between the values for the activation energies for the nuclei growth 
and the penetration of the interface is satisfactory. The values for the rate of pene¬ 
tration would be expected to be the more reliable, since the rates were determined 




522 G. P. Acock, W. E. Garner, J. Milsted and H. J. Willavoys 

under conditions where the self-cooling had become constant, which is not neces¬ 
sarily the case for the measurements with nuclei. For ammonium alum the activa¬ 
tion energy is about 1 kcal. higher than the energy of dissociation. The activation 
energy is 16-5 ± 0-6 and the heat of dissociation is 16-7 kcal. The close agreement 
between the two energies indicates that the coupling between the evaporation 
process and the reorganization of the lattice to give the final products is very close. 
The reformation of the alum lattice in the reverse process by the accretion of water 
vapour thus takes place without appreciable activation energy. The process of 
hydration is thus analogous to that of condensation of water vapour on a water 
surface, and to the reformation of carbonates firom the oxides. 


Table 9. Activatioit energies oe ammonulm and potassitjm alttms 

activation energies 

, - , 

based on self-cooling 

espeiimental calculated from 
self-cooling Planck’s constant 


anunoniton alum 


hexagonal nuclei on 111 face 

16-3 

17*1 

spherical nuclei on 111 face 

16-0 

17*1 

penetration 

16-4 

17-2 

heat of dissociation 

15-7 

— 

potassium alum 

hexagonal nuclei on 111 face 

16-2 

— 

penetration 

16*6 

— 


Table 10 

activation energy 


% chrome alum (kcal.) 

00 16-2 

30-23 16-1 

39-75 17-0 

60-40 24-4 

67-98 21-43 

86-8 17-33 

100-0 31 


The 5-factor in the equation for ammonium alum for the rate of 

reaction in molecules of water per sq.cm, of surface is 8-2 x 10^^ at 35° and 8-6 x 10^^ 
at 25° C, and is 2*2 x 10^^ for potassium *alum. These values are a little lower than 
those obtained in exothermic solid reactions but are in fair agreement with those 
expected from the Polanyi-Wigner equation, in which 10^®. 

Activation energies for mixed alums. The temperature coefficients of the rates of 
growth for mixed alums were only determined roughly; insufficient points were 



Dehydration of ammonium^ potassium and some mixed alums 523 

taken for accurate values to be derived. The change in the character of the nuclei 
on the addition of chromium did not at first affect appreciably the activation energy 
(table 10), but when the chrome alum content exceeded 50 % the activation energy 
depended on the type of nuclei and on the temperature. The cluster type of nuclei 
had high temperature coefficients. 


Discussion 

Nudeation. Water molecules adsorbed by van der Waal forces on the surfaces of 
hydrates will be removed very rapidly in vacuum, but those molecules embedded 
in the ionic lattice will require high energies of activation for their release. The length 
of the induction period before nuclei are formed and the high temperature coefficient 
of the induction period afford evidence of the latter. The release of this water must 
be accompanied by movements of the ions surrounding the water molecules, and 
the energy of activation is probably high because of the necessity for imparting a 
sufficiently large amplitude of vibration or rotation to these ions to allow the water 
molecule to escape. Certain of the water molecules in the alums are structural water 
(Lipson & Beevers 1935) and will be more mobile than the rest. The loss of a water 
molecule will produce a hole which will be partly filled up by some reorganization^of 
the ions and other water molecules in its immediate neighbourhood. This hole could 
be completely filled by the migration of a molecule of water from within the crystal 
if the activation energy for migration were sufficiently low. The activation energy 
for migration would be expected to be of the same order as that for evaporation, so 
that an appreciable defect of water within the crystal lattice should be produced. 
There is, however, no case known where nuclei of a new solid phase are produced 
within the lattice. If, however, a nucleus were formed within the lattice, it would 
grow very slowly because there would be no rapid means for the removal of water. 
Also, there is more mobility for forming new solid phases at the surface than in the 
interior of the crystal, and therefore the absence of nuclei formation in the interior 
cannot be taken as evidence for the absence of lattice defects. 

It is considered that the possibility of such a defect occurring within the crystal 
should always be borne in mind in the interpretation of the phenomena of nucleation. 
Certain of the phenomena found in this investigation may be interpreted in terms 
of a lattice defect. Thus, the length of the induction period which elapses before 
nuclei are formed may, in part, be due to the time needed for building up a suffi¬ 
ciently large water defect within the lattice. The formation of nuclei would be 
expected to be dependent approximately on the ?^th power of the concentration of 
the holes, where n is the number of water molecules which must be removed m one 
localized area in order to start the initial dislocation from which a nucleus com^ 
mences to grow. Where n is large, nuclei will form when the concentration of defects 
approaches a critical value and the time at which nucleation commences wiU be 
reasonably sharp, as is, in fact, found. From the counts of the numbers of nuclei 
of different sizes present at any time, it was deduced that the rate at which the 


Vol. 189. A. 


34 



524 G. P. Acock, W- E. Garner, J. Milsted and H. J. Willavoys 

nuclei were formed varies according to a linear or a second order law (Bright & 
Garner 1934; Garner & Southon 1935; Cooper & Garner 1936). This is what would 
be expected if, during the period in which nuclei were forming, the water defect 
within the crystal was being maintained at a constant or slightly increasing figure* 
The defect within the lattice would be increased by the loss of water from the crystal 
surface and decreased by migration of the holes to the interface of the nuclei. When 
the area of the interface becomes large, evaporation from the bare crystal surface 
may be insufficient to maintain the lattice defect above the critical value at which 
nuclei are readily formed. Therefore, after an initial batch of nuclei has been formed, 
it would be expected that there would be a complete cessation of further nucleation. 
This is actually observed in the oases of common and ammonium alums. On the 
basis of this reconstruction, the initial rapid rate of growth of the nuclei (figure 5) 
can be accounted for as due to the migration of lattice defects to the growing 
interface, which will increase temporarily the rate of loss of water from this 
interface. 

Interface reaction and shapes of nuclei. The striking fact about interface reactions 
is that the removal of water is much easier from the interface than from the crystal 
surface. This is probably due to the coupling of the process of evaporation with an 
exothermic process, the reorganization of the inorganic constituents to give new 
solid phases. There appear to be two types of interface reaction, possibly linked by 
intermediate cases. In the first class, of which copper sulphate pentahydrate, 
potassium and ammonium alum are examples, the activation energies are those 
normal for processes occurring at room temperatures and the B-faotor in the 
equation Tq = is of the order of 10^^ to 10^^. The heat of activation is of the 

same order as the thermodynamic heat of dissociation. In the second class, of which 
chrome alum and calcium carbonate hexahydrate are examples, the activation 
energy is considerably higher than the heat of dissociation and the JS-factor many 
powers of 10 higher than the normal value. The nature of the coupling between 
the evaporation of water and the reorganization of the lattice may be different in 
the two cases. 

Normal interface reactions. Tor these the rate of removal of water from the inter¬ 
face is markedly dependent on crystal direction, and the nuclei possess characteristic 
geometrical shapes which reproduce some of the symmetry of the crystal. The 
processes which occur may be regarded as a reversal of those occurring during the 
growth of a crystal from vapour or solution, in which the crystal is built up along 
selected planes (Volmer 1939). By analogy with the growth and dissolution of 
crystals, the activation energy for the removal of water from any point on the 
interface would be expected to depend on the number of undecomposed units 
surrounding that point. Thus the loss of water from the middle of a plane surface 
will be less probable than that of a molecule placed at a corner or along an edge. 
The interface is not, however, completely analogous to a surface, for there is the 
effect of the second solid phase to be taken into account, which might modify the 
activation energies. The shape of the nucleus will, however, depend on the relative 



Dehydration of ammonium, potassium and some mixed alums 625 

values of the activation energies of the various types of process that are permissible. 
If the activation energies are all approximately of the same order, then no crystal 
direction will be specially preferred and a spherical nucleus will be obtained. In 
the case of potassium and ammonium alums the bounding of the surfaces of the 
nuclei by edges which lie in the 100 planes of the hydrate indicates that the process 
of dehydration is fastest along the 100 planes and that the process occurs layer by 
layer. It would appear that phenomena observed in the dehydration of these two 
alums are normal from a kinetic point of view because of this fact, since layer 
decomposition would facilitate a close coupling between the dehydration process 
and the reorganization of the ions to give a new lattice. 

For the normal mechanism of coupling, the rate is retarded by water vapour, 
and the probable mechanism of the retardation is that the adsorption of water 
molecules on active areas in the interface blocks the dehydration. The rate of 
dehydration becomes proportional to the number of such places not covered by 
adsorbed water. In the presence of water vapour, the nuclei lose their characteristic 
shapes and become approximately spherical, indicating that layer growth is 
hindered, and that the reaction takes other paths which possess equal probability in 
all directions. When the water vapour pressure is just below the dissociation pressure, 
the nuclei possess an irregular outline and become coarsely crystalline. They are 
similar to the nuclei obtained with iron ammonium alum in a hard vacuum and 
resemble the spherulitic growths sometimes formed from solution, where there is 
a very rapid growth of thfe crystals along one axis. Layer growth has obviously 
broken down into a process which is determined more by the properties of the 
crystalline products than by the structure of the original hydrate. 

The addition of foreign ions to the hydrate lattice also causes a breakdown of the 
layer growth, and produces spherical nuclei. The effects are, however, radically 
different from those due to the presence of water vapour. The foreign ions appear to 
stabilize the lattice, conferring on it some of the properties of a zeolitic structure. 
There is evidence that water is removed from the region ahead of the normal inter¬ 
face, which indicates that the movement of holes within the lattice is facilitated. 
The occurrence of such a process would obviously cause a breakdown of dehydration 
layer by layer. 

There is a diffuse region ahead of the interface iu the case of ammonium and 
potassium alums (figure 1 a, 6, plate 12), which appears to consist of fine cracks. This 
phenomenon only leads to second order effects in the case of the two alums, and 
the layer growth controls the phenomena which occur. The formation of cracks 
ahead of the interface might, under certain conditions, have serious effects, for 
if the cracks isolated portions of the cryBtals, these would have to be nucleated 
afresh, and the rate of dehydration would be controlled by nuoleation in the 
cracked area. 

Abnormal interface rmctions. These are characterized by high values of E and 
high 5-factors. A high activation energy would be expected if the process of evapora¬ 
tion were not closely coupled with the formation of crystals of the solid products. 

34-3 



526 G. P. Acock, W. E. Garner, J- Milsted and H. J. Willaroys 

Thus it would be high if the products were produced in an amorphous form. Chrome 
alum gives E ^ kcal. and calcium carbonate hexahydrate ^ — 38 to 40 kcal., and 
these values are so high that it would not be expected that the reactions would occur 
at an appreciable rate at room temperatures. For chrome alum Tq = ^^->zi,ooomT 
mol./cm.and this rate could be accounted for if the excitation of a molecule with 
31 kcal. sufficed for the release of 10^^ other molecules. This would correspond with 
the nucleation of a block of a crystal 10~^ cm. cube. It may well be that for chrome 
alum, cracking occurs ahead of the interface, producing isolated blocks of crystal 
which have to be nucleated afresh. If this were the rate-determining step, the 
high activation energy would be accounted for. 

It has been suggested previously (Gamer 1939) that the activation was a co¬ 
operative process, proceeding at a number of points in the lattice simultaneously, 
thereby producing a dislocation over a wider area than with the normal coupling. 
This would mean breaking down the activation energy into a number of smaller 
values, applied at a number of points in the interface. This might be the correct 
explanation. 

There is a third possibility, that an amorphous product is formed in a transition 
layer which hinders the flow of water molecules from the interface, and that the 
thickness of this layer is dependent on temperature. This would mean that the high 
activation energy was fictitious, and that there should be subtracted from it the 
energy corresponding to the temperature coefficient of the impedance through the 
transition layer. This would also give a reasonable explanation. 

It does not appear to be possible at the present stage to decide which of these 
explanations are correct. 


Refbbbitobs 

Bright & Garner 1934 J, Ohem. Soc- p. 1872. 

Cooper & Garner 1936 Trans. Faraday Soc. 32, 1739. 

Cooper & Gamer 1940 Froc. Boy. Soc. A, 174, 487. 

Gamer 1939 Nature, 144, 287. , 

Gamer & Pike 1937 J. Chem. Soc. p. 1565. 

Gamer & Southon 1935 J. Chem. Soc. p. 1705. 

Garner & Tanner 1930 J. Chem. Soc. p. 47. 

Hume & Colvin 19 Z 9 Froc. Boy. Soc. A, 125, 635. 

Kohlsohiitter & Nitschmaim 1931 Z. phys. Chem., Bodenstein Festband, p. 494. 
Lipson & Beevers 1935 Froc. Roy. Soc. A, 148, 680. 

Smith & Topley 1931 Froc. Boy. Soc. A, 134, 224. 

Smith Topley 1935 J. Chem. Soc. p. 321. 

Spencer & Topley 1929 J. Chem. Soc. p. 2645. 

Topley 1932 a Froc. Boy. Soc. A, 136, 413. 

Topley 19326 Fhil. Mag. (vn), 14, 1080. 

Topley <fe Hume 1928 Froc. Boy. Soc. A, 120 , 211 . 

Volmer & Seidel 1937 Z. phys. Chem. 179, 153. 

Volmer 1939 Kinetih der Fhasenhildung, Steinkopf. 

Zawadzki & Bretsznajder 1938 Trans. Faraday Soc. 34, 951. 



Acock et al. 


Proc. Roy. Soc. A, volume 189, plate 12 









Proc. Boy. Soc. A, volume 189, plate 14 


Acoch el al. 


9 


Figure 4 



Dehyd/ration of ammonium, potassium and some mixed alums 527 

Desobiption of Plates 12 to 14 

PtiATB 12. Figtjbe 1. Nucloi on potassium and ammonium alums. 

potassium alum ammonium alum 

a 111 face h 111 face 

c 001 face ‘ d 001 face 

e Oil face / Oil face 


Plate 13. Figure 3. Nuclei on alums. 
a Rubidium alum, 111 face. 
b Caesium alum, 111 face, 
c Iron ammonium alum, 111 face. 

d Ammonium alum. 111 face. Effect of 0*776 mm. water vapour during growth at 40® C. 
e Ammonium alum, 111 face. Effect of 2*05 mm. water vapour during growth at 40® C. 
/ Potassium alum, 111 face. Effect of 0*776 mm. water vapour during growth at 30® C. 

Plate 14. Figure 4. Nuclei on K2S04[Al,Cr](S04)5.24H20. 


Slow neutron-induced activity in gold 

By K. SiEGBAHiT, Research Institute for Physic, 

Academy of Sciences, Stockholm 

{Communicated by Sir Owen Richardson, F.R.S.—Received 20 Ma/rch 1946) 

The 2‘7 day activity in has been studied by means of a /^-spectrometer of the lens type 

in conjunction with a special technique for studying the y-radiation. The ^-spectrum has an 
xitpper limit = 0*92 MeV. An absorption measurement on the y-radiation in Pb shows that no 
high-energy y-radiation is existent, as has previously been reported. Only one y-ray of 
medium energy (0*401 MeV) follows the /?-disintegration. 

If gold is irradiated with slow neutrons (Fermi, Amaldi, d’Agostino, Easetti & 
Segr^ 1934 )? ^ strong activity appears with a half-life period of 2*7 days. This must 
be due to the gold isotope 198, as gold has only one stable isotope with the mass 197. 
The same active isotope can be produced in several other different ways, for example, 
by deuteron-irradiation of gold (Cork & Thornton 1937) and platinum (Lawson & 
Cork 1940). At the same time, however, other active gold isotopes are produced, and 
hence the methods mentioned are not suitable if it is desired to study ^^^Au alone. 
Fast neutrons, on the other hand, give rise, besides the above-mentioned activity, 
to two others, though considerably weaker, with the half-life periods 13 hr. and 
3-3 days (MacMillan, Kamen & Ruben 1937). 



528 


K. Siegbahn 

Several investigations of the disintegration of ^®®Au have already been performed, 
different methods being used: absorption measurements, coincidence measurements, 
investigations in the Wilson chamber and the yff-spectrograph. The results have 
hitherto shown little mutual agreement, which, in some cases, may be explained by 
the presence of active isotopes other than ^®®Au. A survey of earlier measurements 
has been published by Feather & Dainty (1944). For the sake of completeness some 
of the essential results will be repeated here. 

i 98 Au emits both and y-radiation. It is transformed by y(?-emission into ^ 9 ®Hg. 
Sizoo & Ejkman (1936) have, however, found, in absoi^ption measurements on the 
y-radiation, a weak y-energy at 2*6 MeV. This is explained by assuming that ^®®Au 
may also be transformed into ’■®®Pt by A^-capture. 

The yff-spectrum has been studied by, among others, Richardson (1939) in a 
Wilson chamber. The sample consisted of a gold foil that had been irradiated with 
slow neutrons. The spectrum proved to consist of a highly asymmetrical, con¬ 
tinuous distribution, with its upper hmit at 0-83 MeV. Besides, a strong /?-line 
was found, corresponding to an internally converted y-energy of 440 keV. The 
conversion coefficient was as high as 0-1. The upper limit of the /ff-spectrum has 
been fa]rtiher studied by Sizoo & Ejkman, using the absorption method. The 
result j^nxax. = MeV was obtained. Using the same method, Cork & Halpern 
(1940), among others, obtained the result E„,„ — 0-74 MeV and Clark (1942) 
-®max = 0*78 MeV (corrected by Feather & Dainty (1944) to 0’87 MeV). Using a 
particularly exact absorption method (Feather 1938), Feather & Dainty determined 
^max. 0 * 985 + 0-010 MeV. 

The y-radiation accompanying the disintegration is of special interest. In several 
different cases, two y-lines were discovered, with energies of about 260 and 460 keV 
respectively. At the same time, a weak radiation_of about 70 keV was obtained. 
The latter radiation has been supposed to be X-radiation from ^®®Hg (X-radiation). 
This explanation may be considered plausible, in view of the existence of the strong 
/?-line. The following investigation on the y-radiation may be given brief mention. 
Richardson (1939) determined, in a Wilson chamber, the ffistribution of the photo¬ 
electrons ejected by the y-radiation from a lead foil. He obtained three photo-lines, 
corresponding to the y-energies 70 , 280 and 440 keV. A slight trace of yet another 
y-energy of about 0-7 MeV was also obtained. The ratio between the intensities for 
7280 7440 estimated as l- 0 : 1 - 2 . Using the absorption method, Sizoo & 

Ejkman (1936) obtained the y-energies 73,260 and 410 keV. Besides, they obtained 
a weak y-radiation of about 2-6 MeV (about 2 quanta per 100 disintegrational). 
The two medium soft y-components were also found by Cork & Halpern (1940) 
(240 and 800 keV) and Krishnan (1941) (210 and 600 keV). Plesset (1942) studied 
the y-radiation from gold, irradiated with deuterons of 11 MeV, lead being the 
secondary radiator. He obtained a number of photo-lines, three of which were 
attributed to ^ 9 ®Au. .The photo-lines had the energies 63 - 9 , 323 and 389 keV. As the 
difference between the two latter corresponds to the difference between the K- and 
i-absorptions energies of lead, the origin of both lines may be one y-line with the 



529 


Slow neutron-induced activity in gold 

energy 402 keV. On the other hand, the intensities of the two lines were nearly 
equal, which, our supposition being right, would not be the case. (The ratio should 
be about 7 :1.) 

A coincidence investigation by Clark (1942) showed that y-y as well as ^-y 
coincidences were obtained. By the use of NorlLag’s (1941) efficiency cuirve for lead 
y-y-tubes, the coincidence measurements could be accounted for on the basis of 
Richardson’s disintegration scheme (figure 1) which assumes that the two y-rays are 
emitted in cascade and that each y^-particle is accompanied by two y-quanta (dis- 
regarditlg the suggested possibility of an extremely weak y-transition direct to the 
ground state). Clark’s coincidence measurements seem quantitatively to agree very 
well with the corresponding investigations of Feather* Dainty (1944). Norling (1941), 
on the other hand, could not obtain any y-y comcidence effect. On the other hand, 
heobtained a very strong ;ff-y comcidence effect, which undoubtedlyindicates that at 
low energies, the y-efficiency curve passes through a maximum (see p. 56 in Norling’s 
paper). 

'“Au ‘“Hg 


p 0-78 MeV 


0-28 McV 


0-7 MeV 
7 


0-44 MeV 


Figure 1. Previously proposed level scheme for the disintegration of ^*®Au. 


The yff-SPBOTBUM 

Since at present there is no method, admitting of a substantial concentration of 
active in relation to inactive gold, gold in the form of a thin foil (0-02 mm. thick) 
was irradiated with slow neutrons from the Stockholm cyclotron. A sample of such 
a thickness caimot, of course, be expected to give a /^-spectrum other than with 
considerable distortion. The foil, whose diameter was 8 mm., was placed in the 




530 K. Siegbahn 

y!?-lens spectrograph (Siegbahn 1946), the resolving power of which was at a value 
of AHplHp = 5 %. At this adjustment of the spectrograph, the diaphragm system 
transmits 2 % of the radiation. With the spectrograph so adjusted it has proved 
possible, with preparations of sufficiently long half-life period, to measure the 
/d-speetrum at intensities as small as some hundredths /iC. 



(f)‘ 



Figure 2 shows the yff-speotrum of ^®®Au. It is simple in character, as seen from 
the relatively symmetrical form. At low energies, however, it can be observed that 
the absorption in the gold foil has become perceptible, the spectrum displaying a 
comparatively small number of low-energy electrons. The spectrum shows only 



531 


Slow n&utron-mduced activity in gold 

one ^-line, with Hp = 2200, corresponding to an energy for the y-quantum = 
0'401 MeV. The corresponding i-line iTp 2470 was also observable but has been 
omitted, as the statistics employed hardly permitted any definite conclusion 
regarding its magnitude. The size of the y^^^-hne relative to the remaining continuous 
spectrum is smaller than that previously observed, which may in some part be due 
to the absorption in the gold foil. According to Richardson’s determination, the 
yd-Mne should correspond to an internal conversion coefficient of 10%, whereas 
Plesset finds = 4 %. The uncorrected value of is found from figure 2 to be 
about 1 %. 

The upper limit of the spectrum is most accurately obtained by finding the 
Fermi curve for the spectrum. Figure 3 shows the end of this curve. The upper limit 
is determined from this as 0-92 + 0-01 MeV. 

Even if the yff-spectrum reveals only one y-Une, there are possibilities of more 
than one such line occurring through disintegration. The high value of aj^ for the 
yff-line at 0-401 MeV is probably due to this radiation having a quadrupole character. 
If there are other y-hnes of a dipole nature, these are not certain to bring about 
internal conversion to an extent large enough to give detectable yff-lines. It is there¬ 
fore necessary to study the y-r 4 diation directly. 

y-ABSOBPTIOIT IN LEAD 

In the first place, the question arose of ascertaining, by an absorption measure¬ 
ment, the presence of the weak, hard y-component of 2-6 MeV, reported by Sizoo 
& Ejkman (1936). For this purpose a y-tube was used, the cathode being a 1 mm. 
thick brass cylinder. The absorption process was observed for absorption thicknesses 
up to 28 mm. of lead. Figure 4 shows, in a logarithmic scale, the curve obtained. The 
intensity, at the last point measured, is only about 4 % of the intensity without 
absorption screens. Nevertheless, even these last points lie entirely within the 
limits of error on the straight line that may be drawn through the other points, and 
hence the determination does not support the presence of any hard component. This 
is in agreement with the conclusion of Feather So Dainty {1944) based on less exten¬ 
sive evidence. Since it was necessary to extend the absorption measurements to 
such large absorption thicknesses in order to establish the presence of the hard 
component, the sample could not be placed, owing to difficulties of intensity, as far 
from the GM tube as would be desirable for a good quantitative determination of 
the energy of the y-radiation. The crude y-ray absorption coefficient corresponds 
to an energy = 0-46 MeV. This value is considerably higher than the more accurately 
determined Ey - 0-401 MeV from the /?-spectrum. 

One cannot expect to establish from the absorption curve, considering the 
thickness of the wall of the y-tube, presence of the y-ray of 70 keV reported by 
several authors. It is remarkable, however, that the y-component of about 260 keV, 
also often reported, cannot be detected by any procedure. With regard to the geo¬ 
metry not being ideal, it is, however, difficult to draw any definite conclusion from 



532 


K. SiegbaJm 

the absorption curve as to the non-existence of this y-energy. It is certainly known 
that it is sometimes possible to deduce a purely, exponential absorption process, 
even on occasions when several y-lines occur (Roberts, Downing & Deutsoh 1941). 



Figubb 4. Absorption curve for the y-radiation of ‘**Au in lead. 

The study of the y-EAYS in the sfbotroqeaph 

In order to investigate this question closer, it is suitable to apply the more accurate 
photo-Compton method with the aid of the yff-spectrograph. Gold in the form of 
0-02 mm. foils was irradiated in the cyclotron with slow neutrons for a few days, so 
that the total activity was equivalent to lO/tCRa. The gold was then packed into 
a small cylindrical Cu radiator (^ = 8 mm.), the waU thickness of which was suffi¬ 
ciently large, according to Feather’s formula, to absorb entirely the electrons of 
the continuous /^-spectrum. The energy distribution of the secondary electrons 
released in the Ou radiator was studied in the ;S-spectrograph. A 0-1 mm. thin lead 
foil was then placed in front of the Cu radiator, and a new energy distribution curve 
was recorded. In the &st case, it is fairly certain that the Compton effect in the 
copper provides the main component in the secondary spectrum and in the latter 
case both the Compton effect and the photo-effect in the lead foil will be seen. 

Figure 6 shows the former registration. In order to give an idea of the sensitivity 
of the arrangement, I have not plotted along the axis, in this figure and figure 6, 
the intensity, i.e. the number of impulses counted per min. P divided by the corre¬ 
sponding fTp-value, but P alone. Figure 5 suggests at first sight the existence of 
two y-components, the lower energy corresponding to that first obtained from the 



p 


Slow neutron-induced activity in gold 


533 



radiator by the y-radiation of 



FiotTRB 6. SeQondary electron spectrum expelled from a Cu 
radiator covered with a 0*1 mm. lead foil. 



534 K. Siegbahn 

yff-speotrum. The next figure, however, makes it clear that the second, energy peak is 
due to the photo-effect in the copper, which, though certainly weak, is quite per¬ 
ceptible at these comparatively low energies of the y-radiation. The present author 
has previously found, in several oases, that the most accurate possible value is 
obtained for the y-energy, if the upper energy limit of the Compton distribution 
(i.e. the limit corresponding to the impact angle ^ = 0 ia the Compton effect formula) 
is placed at the inflexion point of the energy distribution. Using this procedure, 
the value Ey = 0*401 MeV is obtained from figure 5. The energy of the y-radiation 
may also be determined from the peak of the photo-hne. This has the JETp-value 
2484, i.e. 0*387 MeV. If the work of liberation for the K shell in Cu, 9 keV, is added, 
then Ey = 0*396 MeV. 

Figure 6 shows the distribution of the secondary electrons, when the lead foil was 
placed before the radiator. This curve is composed of three different components: 
the Compton electrons (the broken line represents the curve of figure 6 for com¬ 
parison), the K photo-line in Pb and the L photo-line in Pb. It is evident that the 
higher energy maximum in figure 5 cannot be a Compton distribution from a second 
y-component, as the photo-line that would correspond to this y-energy in figure 6 
is missing. The peak of the K photo-hne occurs at Hp — 2161 = 0*316 MeV. Hence 
Ey is obtained by adding the K absorption energy of the lead = 0*088 MeV, i.e. 
Ey = 0*403 MeV. E is obtained in the same way from the i/ photo-hne, the Hp-value 
of which is 2461 as (Ex,^^ =0*016 MeV) 0*401 MeV. It is satisfactory that the energy 
values deduced from the internal and external conversion effects agree so closely, 
and it should further be noted that the final value of the y-ray energy deduced 
from the present experiments are in excellent accord with the value obtained by 
Plesset ( 1942 ). It is further satisfactory that the strai^e relation between the 
intensities of the two photo-hnes reported by Plesset does not appear in the present 
investigation. 

Figure 6 iUustrates weh the sensity of the spectrograph. It proves, among other 
things, that the activity of only 1 pCRa y equivalence would still have given about 
100 kicks/mm. at the peak of the K photo-line. Since the zero effect of the CM tube 
gives about 6 kicks/min., it appears possible to use, within this energy range, still 
weaker samples for recording y-spectra. 

The conclusion drawn from figures 6 and 6 is that there is only one y-line, and, 
accordingly, that the y-line at about 260 keV, earlier reported by several authors, 
must be regarded as non-existent. The present investigation is not concerned with 
the very low energy range at about 70 keV, in which a y-line has earlier been reported. 
As pointed out earlier this was considered to be X-radiation from mercury, being 
the result of internal conversion from the y-energy at 401 keV. Feather & Dainty 
(^ 944 )> however, have made a closer study of the absorption process in this energy 
range and seem to have succeeded in proving the presence of a nuclear y-line, the 
energy of which is somewhat lower than that of the X-ray, i.e» about 66 keV. If 
the existence of a nuclear y-radiation of 65 keV is accepted, there are two possibilities 
for its emission. It may be emitted after the whole yd-spectrum or it may be 



535 


Slow neutron-induced activity in gold 

responsible for a complexity in the yff-spectrum, the latter being composed by two 
components differing by 66 keV in energy. In the first case the two y-lines must 
be of equal intensity. Very carefully performed coincidence experiments might 
probably give some further information on this point. 


References 


Clark, A. 194 a Phya, Bev. 61, 242. 

Cork, J. M. <& Halpern, J. 1940 Phys. Rev. 58, 201a. 

Cork, J. M. <fe Thornton, R. L. 1937 Phys. Bev. 51, 59. 

Feather, N, 1938 Proc. Camh. Phil. Soc. 34, 599. 

Feather, N. <fc Dainty, J. 1944 Proc. Gamb. Phil. Soc. 40, 57. 

Fermi, E., Amaldi, E., d’Agostino, 0., Rasetti, F. <fe Segr 6 , E, 1934 Proc. Boy. Soc. A, 146, 
483. 

Krishnan, R. S. 1941 Proc. Camh. Phil. Soc. 37, 186. 

Lawson, J. L. & Cork, J. M. 1940 Phya. Rev. 58, 580. 

MacMillan, E., Kamen, M. & Ruben, S. 1937 Phya. Bev. 52, 375. 

Norling, F. 1941 Ark. Mat. Aatr. Fya. A, 27, no. 27. 

Plesset, E. H. 194 a Phya, Rev. 62, 181. 

Richardson, J. 1939 Phya. Rev. 55, 609. 

Roberts, A., Downing, J. R. & Deutsch, M. 1941 Phya. Bev. 60, 544. 

Siegbahn, K. 1946 Phil. Mag. 37, 162. 

Sizoo, G. J. & Ejkman, G. 1936 Phyaica, Eindhoven^ 6 , 332. 


Mixed boundary conditions in tbe relaxational treatment of 
bibarmonic problems (plane strain or stress) 

By L. Fox, D.Phil. 

{Communicated by M. V. Southwell, F.B.S.—Received 6 July 1946) 


Relaxation methods have already been applied to the solution of four problems of (i) exten¬ 
sion and (ii) flexure of flat elastic plates, in which (a) displacement or (6) traction is specified 
at the boundary. Here the method is adapted to the case in which the two types of boundary 
condition are mixed, where photo-elastic methods are difficult to apply. 

Two exaxnples are treated by relaxation methods, and the results obtained indicate that 
this method may be a valuable alternative in engineering problems. 


Introdttotion 

1 . Part VIIA of the series ‘ Relaxation methods applied to engineering problems ’ 
(Fox & Southwell 1941 ), was concerned with the solution of problems involving 
flexure or extension of flat elastic plates, for which the governing equations are 



536 L. Fox 

biharmonic. Four cases were' considered, dijffering in respect of the boundary 
conditions, namely: 

1 (a). Problems of extension in which the boundary conditions specify displace¬ 
ments Ui V in two perpendicular directions. 

1 ( 6 ) Problems of flexure in which the boundary conditions specify the boundary 
tractions. 

2 (a). Problems of extension in which the boundary tractions are specified. 

2 ( 6 ). Problems of flexure in which the boundary displacements are specified. 
The governing equations for Case 1 (a) are (in plane stress) 


0 /0'M 0«\ 

dx \0a: dy] 

1-0-i 

W+^1 

1 = 0,' 

0 Idu dv'^ 



1 = 0, 

dy\dx'^dyi 

' l + cr 

1 J 

' J 


where X, Y are body forces. In Case 2 ( 6 ) a single equation 

= Z (2) 

relates the transverse displacement w with the intensity Z of applied transverse 
loading. It was shown in the paper cited that problem 1 ( 6 ) can be formulated in 
equations of the type ( 1 ), and that a similar analogy holds between problems 
2 (a), 2 ( 6 ). 

2 . When the boundary conditions are ^mixed’ (displacements specified at some 
points, tractions at others), the problem is much more difficult both for analysis 
and for (photo-elastic) experiment. A technique for solving problems of this kind 
would accordingly have value, and an accuracy of, say, 5 % would be sufficient 
for most engineering purposes. The boundary conditions, where they specify 
tractions, present difficulties, especially at a curved boundary; but recent develop¬ 
ments of the relaxation method have gone some way towards overcoming these 
difficulties, though at some cost in accuracy compared with what is attainable when 
the boundaries are rectangular. 

Here, two problems of the kind are solved: the first, an example treated by 
Timoshenko ( 1934 , § 43) in which the boundary is a square; the second, an example 
having a circular boundary, of which the analytical solution is known. Satisfactory 
results are obtained in both. 


Desoriptiox oe method 


3. In the absence of body forces the governing equations are 


0 /0u dv\ 
dx \0a; 0y/ 

0 / 0^6 dv\ 
dy\dx'^dy}'^ 


1—cr 
1+a 


V% = 0.] 


1 - 0 - 


V“>w = 0. 


(3) 



RelaxcUional treatment of hiharmonic problems 537 

On the boundary, u and v are given at some points; the boundary traction at others, 
i.e. specified values of 

= Zj, cos {«, p) + Xy sin («, p), 

Y„ = Xy cos (», v)+sin (a;, p). 

Expressing these in terms of u, v by means of the stress-strain relations 



as boundary conditions for points where X„ 7^ are specified. 

Together with the governing equations (3), the boundary conditions (thus 
expressed in u and v) define the problem completely. They could be used for problems 
of class 2 (a), in which boundary traction is specified everywhere; but the method 
of Part VIIA is usually simpler and accordingly more accurate. 







Fiqubb 1 


As an example we take the problem considered by Timoshenko: a square plate 
of side 26 is stretched by forces acting on two opposite ends, the forces being 
distributed according to a parabolic law (figure 1). Here the boundary conditions 
(5) are simple: we have 

(x, p) = Itt on PQ, 0 on QS, 

= (l-yW) on Q8, 0 on PQ, 

I], = 0 on all boundaries. 



L. Fox 




FiauBB 2 


(6) 


(7) 


( 8 ) 


When 0 is on a boundary, say PQ, the points 2, 5 and 6, lying outside the field, are 
‘ fictitious ’ and may be eliminated from (8) by means of the boundary conditions (7). 
At 0, these are 

“'^3 "t *^2 *^4 “ 

t^2“^4 + ^(%-'^3) = 0,J 

giving t ?2 ^ terms of points on and internal to the boundary. Application of the 
boundary conditions at points 1 and 3 give u and -y at 5 and 6 respectively. The 
results of elimination are 

{2-cr( 1 + cr)}{u^-^u^) +2(1(I-cr) (v^^ - v^) ~ Uq {&- 2cr{2 + cr)} = 0,1 

\ ( 9 ) 

- 2cr{V;^ + Vq) + 4:V^ - 2cr(% ~ - 4( 1 - cr) = 0, J 

and similar equations hold for points on vertical boundaries. We are thus left with 
two equations for two unknowns, u and v, at every nodal point, including boundary 
points: and a solution can be obtained by relaxation in the usual way. 



Relamtional treatment of hiharmonic prdbkms 539 

One-eighth of the side of the square was taken as the mesh-length, and u and v 
deduced at nodal points. By symmetry, only one-quarter of the plate required 
consideration, the conditions on Ox being w = 0, dujdy — 0, and on Oy, u- 0, 
dvjdx = 0. The stress was computed along Oy, and its distribution is given in 
curve III of figure 3. * . 

Timoshenko used a strain-energy method, and obtained two successive approxi¬ 
mations to this stress-distribution, the better one of which is shown in curve II of 
figure 3. Curve I shows the magnitude of the parabolic loading. Even with this 
coarse net, the agreement with Timoshenko’s result is close. 


4. When the boundaries are curved, the practical difficulties are greater, and 
simplifying assumptions have to be made. The problem considered (figure 4) was 
a circular plate of radius 6, with a concentric hole of radius 2. Equal radial displace¬ 
ments are imposed round the hole, and no traction acts on the outer boundary. 

By symmetry, only one-eighth of the plate requires consideration. The governing 
equations are equations (3), and'the boundary conditions are: 


on AB, 


u = ooad, w = 8in0, 


on OP, 2j 

'du dv] 

|cos0-F(l-(r)j 



fdv Stfr'l 

jsin0-i-(l—(r)| 

^dx'^dy) 


cos 0 = 0, 


on jBD, 
on AC, 


® = 0 , 


du 


dy 
. u — v. 


= 0 , 


( 10 ) 


Figure 6 shows the first net investigated. Internal points whose finite-difference 
formulae involve external points are &t A, B, 0, D; points 1, 2, 3, 4 are fictitious, 
and P and Q lie on the boundary. Elimination of these fictitious points calls for some 

rather tedious computation, andapproximations are used to reduce thelabour without 
affecting the accuracy too greatly. No fixed rule can be stated, since the proper 
approximations depend largely on the position of the boundary with respect to 
the mesh lines. The following approsdmations were used in figure 6: 

Regarding P as a fictitious point, then 

2a(|^) = Zup-~4=Uj^+UE, 

Similar formulae for v derivatives and subsequent substitution in (10), give Up, Vp 
in terms of internal points. The boundary conditions at L are satisfied by 


d 

“a 

^ =^Us-Us, 2a(| 

s) ‘HI 

u\ 

-I 

\d. 


yjL 

yjs 


Vol, 189. A. 


35 



540 


L. Fox 


with similar formulae for v derivatives. Substitution in (10) gives Ug, Vg in terms of 
internal points. 

At M we have 


/a 

- ===Ui-Uc, 

2a(| 

;) 

& 

1 

11 

\3 



I//m \o: 

y/4 


and at Q, 






Figtoe 3 




Figuee 4 


Figtob 5 






Relazational treatment of hiharmonic problems 541 

■with similar formulae for v derivatives. Substitution in (10) gives v^, Uq, but the 
values at points 2 and 1 are still left undetermined, and for them we amutyia a para¬ 
bolic continuation of u and v outside the field. The requisite type of formula is 

^ ■“ S'Ujp “i" . 

Substitution of these fictitious values in the governing equation (8) gives two 
simultaneous equations at every internal point. (Points P and Q could have been 
regarded as internal points; but for them the finite-difference formulae involve 
points which are too far outside the field to permit accurate or easy elimination.) 



Fiottbb 7 


35-2 



642 


L. Fox 


The resulting equations are complicated, but many of the coefficients are small 
enough to be neglected in relaxation, though they must of course be included in the 
calculation of residuals. 

The results of relaxation on this net are given in figure 6, with analytical values, 
for comparison, to the right of the nodal points. 

5. A finer net was investigated, with results which are given in figure 7; their 
closeness to the analytical solution, especially near the outer boundary, is very 
■satisfactory. Near the inner boundary, on which u and v are given, the accuracy is 
not so good. The method adopted near this boundary was that of Part VII a, that 
is u and v were regarded as continuing linearly across the boundary. The accuracy 
would probably be improved by taking u, v to be parabolic across the boundary. 
In the general case in which a central point 0 is surrounded by eight points (figure 8) 
whose distances from 0 are o, (r = 1 to 8), the finite-difference equivalents of equa¬ 
tions (3) become 


Oj+aj 


4 




I 2(1-(t) ^ 1+0- K , '»7\M 

Vl O3/ "a + \“2 “J “6 + «7 \»5 »7/ “«+ «81«6 “s/ 

Kffls «2«4 i \«6«8 «5«7/ 

fa , I 2(1-(r) M ^ «8\ ^ l-fcr /us ^ l-fcr /u^ ^ ttgX 

"(oja, OiUj j “Vgag aga,/ 



JFlGtTBE 8. 


The equations for points near the curved boundary are then obtained by putting 
a, equal to the standard mesh length a or a.^2 for those points lying inning, the 
boundary, and to the measured distances for those points lying on the boundary, 
where the values of u and v are specified. 

These formulae, though lengthy, are not more cumbersome than those already 
used at the outer boundary, and might have been applied with advantage; but t.hin 
was not done, because the purpose of the investigation was to see how well the 


Relo/ocational treatment of hiharmonic ^problems 543 

boundary conditions (10) could be satisfied and used in the relaxation process. The 
accuracy attainable would seem, to be sufficient for all practical purposes. 

Grateful acknowledgement is made to Dr J. R. Green, for collaboration in the 
devising of the method; to Mr D. N. de G. Allen, for the original suggestion, and to 
Miss G. Vaisey, for the computation of the second problem; to Dr R. V. Southwell, 
for overall supfervision. 


ReS'ERBNCBS 

Fox, L. & Southwell, R. V. 1941 PhM. Trans. A, 239, 419. 
Timoshenko, S. 1934 Theory of eiasticity. New York: MoGraw Hill Co. 


Vertical transport of heat by turbulence in the atmosphere 
By 0. H. B. Priestley and W. C. Swinbank 
{Communicated byB. Gold, Bit,8.—Received 21 June 1946 —Read 27 February 1947) 


A neoeasary consequence of the classical theory of the turbulent transfer of heat in the 
atmosphere is that the flux of heat is in the direction from high to low potential temperature, 
and this normally involves the dux being foom low to high actual temperature. On examina¬ 
tion this is shown to be consistent with the second law of thermodynamics. 

However, the theory assumes that an eddy may be regarded as a normal sample of the 
population at its level of origin; but when account is taken of buoyancy effects in a necessarily 
heterogeneous atmosphere it is clear that this assumption is incorrect, It also becomes 
apparent in the discussion that a definition of the concept ‘level of origin* of an eddy more 
rigid than that given in the classical theory is necessary, and this has been given. It is 
shown that a component of heat fiux, here called ‘convective turbulence’, must result from 
the buoyancy forces. This is always directed upwards and is thus opposed to the general 
downward transfer of mechanical turbulence. 

Hecords of the fluctuations in temperature at fixed levels show amplitudes which cannot be 
explained by mechanical turbulence. The traces further indicate that convective tiurbulence 
may at times completely dominate the direction of the flow of heat, a conclusion which is 
supported by reference to some synoptic examples. This result helps to explain the normal 
increase of potential temperature with height. , 

Dimensional considerations provide a rough measure of the dependence of the buoy^cy 
flux on the lapse rate and amplitude of fluctuations, and also allow the total flux to be written 

in the classical form K '9 which may be positive or negative, will depend 

upon the synoptic situation. Some of the impUcations in synpotic and general theoi;etioal « 
meteorology are discussed in detail. The turbulent transfer of physical quantities other 
than heat is briefly considered. 


L Intbodttotion- 

The vertical transport of heat in the atmosphere is mainly eiBfected by the following 
mechanisms: eddy or turbulent diffusion, the mass movement of air in subsidence, 
convexion and upslope motion at fronts, radiation and the transfer of latent heat of 



544 C. H. B. Priestley and W. C. Swinbank 

vaporization of water by the vertical interchange of water vapour. Transfer of heat 
by molecular diffusion can be regarded as negligible. This paper will be concerned 
primarily with the first mechanism, and the discussion will be restricted as far 
as possible to occasions when mass transfer of air and latent heat effects are not 
operative. 

The chief agent in the interchange of radiant energy in the lower atmosphere is 
water vapour, the activity of which is all in the infra-red part of the spectrum. This 
interchange has been discussed by Brunt (1929), who concludes that the radiative 
transfer of heat in the lower atmosphere, while enormously greater than molecular 
transfer, can in general be neglected in comparison with turbulent transfer. Brunt’s 
work was based on Hettner’s values of the coefScients of absorption. Since recent 
work has shown that these coefficients are too high, it appears that water vapour 
should play a yet smaller part in its effect on the redistribution of temperature than 
was previously supposed. 

Eecent work in America on the water-vapour spectrum has been incorporated in 
an atmospheric radiation chart designed by Elsasser (1942). Work with this chart 
has shown that, at levels in the atmosphere away from the surface, radiative 
transfer is quite negligible compared with accepted values for the fiux of heat by 
eddy diffusion. This conclusion is confirmed by Elsasser himself and Ashburn (1940). 

The effect of radiation is to carry heat from high to low temperature and, as will 
be shown later, is thus in* general opposed to the direction of transfer by eddy dif¬ 
fusion. But the evidence quoted above shows that, over short intervals at least, 
radiative transfer is completely outweighed by eddy transfer. 


2. The classical theory oe eddy DiFEtrsioN 

The vertical transport of heat by eddies in the atmosphere has been discussed 
by Taylor (1915) in a treatmeiit later modified by Brunt (1929, 1939). They con¬ 
sidered the transfer of heat by eddies across a horizontal surface in the atmosphere 
at height z (strictly an isobaric surface should be considered, but the approximation 
is amply justifiable). The pressure, environmental temperature and density at this 
level are denoted by p, T^{z) and p^(z) respectively. 

An eddy which crosses this level with upward vertical velocity w is regarded as 
having a density p and temperature 

%{z) + T', 

where T' maybe termed the ‘temperature anomaly’. The upward flux of heat across 
the surface per unit area per unit time is then 

c^{pw%+pwT'), 

being the specific heat of air at constant volume. The bars denote mean values 
over the area. Since there can be no net transfer of mass across an isobaric surface, 



Vertical transport of heai by turbulence in the atmosphere 545 

the average value of pw is zero when the area and the time considered are both 
suiiiciently great, and the flux of heat reduces to 

c^pwT'. 

In this expression the variation of density between eddies crossing the datum level 
has been allowed for. But whereas w and T' assume both positive andnegative values, 
p is an essentially positive quantity which in practice varies by only a small per¬ 
centage of its mean value. The existence of an appreciable flux depends ultimately 
on the existence of a correlation between w and T', and with sufficient accuracy 
the flux may be represented as 

Poc^v^'. ( 1 ) 

Professor Taylor and Professor Cowling have pointed out, in correspondence, 
that this is not the only flux to be considered. There is also a mean rate of working 
pw when there exists a mean velocity w through the pressure surface. Writing p' 
for the ‘ density anomaly ’ defined in the same way as T', then in a constant-pressure 


surface 

p' T’ ^ 

=0. 

Po ^0 

Now 

pw — PqW+p'w = 0, 

so that 

Po “‘o 


^and the flux of mechanical energy is 

pw = ~wT' = pg(Cj,-c^)wT' 

(Cp is the specific heat of air at constant pressure). When the two fluxes are com¬ 
bined, the total flux of energy is accordingly 

PqOj,wT'. ( 2 ) 

The classical theory regards an eddy as having originated at a depth I below the 

9jP dT 

surface, where the temperature is Tq — Z being the lapse rate of tempera¬ 

ture, assumed constant in the layer z to z—Z. The eddy is assumed to be an average 
sample of the air at its level of origin, and to move adiabaticaHy during its transfer 
to the level z. On these assumptions the temperature anomaly T' is equal to 

where F is the dry-adiabatic lapse rate of temperature, and the 
upward flux of energy across the surface is then 



646 0. H. B. Priestley and W. C. Swinbank 

The product wl is supposed essentially positive for both upward and downward 
moving eddies, since w and I have the same sign for a given eddy. The mean value 
of this product to! is denoted by K, called the coefficient of eddy diffusivity. The 
flux of energy, written as 

j?' = -Poc^z(^+r), (3) 


is thus necessarily from high towards low values of potential temperature, since, 




= potential temperature 





if 


3. Some aspects oe the classical .theoby 

U 

There are two aspects of the effects of the turbulent transfer of heat that can be 
examined in the light of recognized principles of thermodynamics and mechanics. 

Equation (3) shows that the transfer of heat by eddy diffusion is from regions of 
high to regions of low potential temperature. A characteristic feature of upper-air 
temperature soimdings is the universal decrease of temperature with height through¬ 
out the troposphere (except occasionally in the layers near the ground), and the 
stability of the air with respect to the dry-adiabatic lapse rate. It is readily seen 
that this latter effect implies increase of dry-bulb potential temperature with 
height. Thus the general effect of eddy diffusion throughout the troposphere, in the 
absence of evaporation and condensation processes, is to transfer heat downwards 
from regions of low to regions of higher temperature. This may at first sight appear 
to be a contravention of the well-established Second Law of Thermodynamics. The 
apparent difficulty can be resolved when it is realized that turbulent transfer in the 
free atmosphere derives from the existence of vertical wind shear; and wind shear is 
an effect of horizontal temperature gradients which are ultimately derived from 
heat which has been transferred upwards from regions of high to regions of low 
temperature. Thus the downward transfer of heat by eddy diffusion against the 
temperature gradient may be regarded as one stage in a process which, viewed as 
a whole, does not conflict with thermodynamical principles. 

There is also an aspect of the effect of turbulent diffusion of heat which appears to 
conflict with accepted principles of mechanics, but which again can be resolved in 
terms of the wind shear. If one considers a layer of the atmosphere wherein the lapse 
rate of temperature is less than the dry-adiabatic, and excludes oases of condensation 
■and evaporation, then the ultimate condition of the layer as a result of eddy transfer 
of heat wiU be one of potential temperature uniform with height. The gravitational 
potential energy of the layer in this state is greater than in the i n itial state, and this 
would appear to conflict with the minimum potential energy principle of mechanics. 
The work necessary to effect this increase in gravitational potential is done by the 
wdnd shearing stresses, and when a vertical wind shear exists it can be regarded as 
a form of potential energy. As shown by Petterssen& Swinbank (unpublished), the 



Vertical transport of heai hy turbulence in the aimosphere 547 

■wind shear across a layer of uniform potential temperature is "very small, and so 
the ■work done by the shearing stresses to increase the potential energy of the layer 
reduces the wind shear potential energy. Thus when all forms of potential energy are 
considered, the consequence of eddy transfer of heat is not necessarily an increase 
of total potential energy. Indeed Richardson ( 1920 ) has used as a principle that the 
work against gravitational forces in turbulent mining is done by the ahpia.ring 
stresses. 

On the other hand, it remains a problem to explain the almost universal increase 
of potential temperature ■with height in the free atmosphere. The classical theory of 
eddy transfer presents a powerful mechanism which constantly strives to destroy 
this increase. The theoretical determination of the time necessary to approach such 
a state (it would clearly take ii^finite time to achieve it completely) is difficult, but 
rough approximation shows that layers of a few thousand feet in depth, under the 
undisturbed influence of turbulent mixing, should approach the adiabatic state in 
a time of the order of magnitude of hours, assuming accepted values of in the free 
atmosphere of order 6 x 10 * c.g.s. (see Brunt 1939 , p- 256 or Haurwitz 1941 , p. 212 ). 
Occasions when large-scale vertical motion, and thereby condensation effects, can 
be excluded from consideration for considerable periods are fairly frequent. Thus a 
lapse rate approaching the adiabatic through appreciable depths of the troposphere 
should be a phenomenon of fairly frequent observation. But a feature of upper-air 
temperature soundings is the singular lack of such distributions of temperature 
away from the surface layers. 

Certain other criticisms of the classical theory may be made on logical grounds. 
Equation ( 2 ) shows that a flux of energy can only result when the temperature at 
the level considered is subject to the fluctuations represented by T'. Thus when 
a flux of energy through the atmosphere is operative, the temperature at any level 
must fluctuate, both in time and in the horizontal plane. In spite of this necessary 
variation the classical theory, though purporting to assume that an eddy starts as 
a ‘normal’ sample of its environment, in fact regards it as an ‘average’ sample. 
‘Normal’ implies randomness of selection at the level of origin so that a normal 
sample would be subject to the fluctuations appropriate to that level. These fluctua¬ 
tions are not allowed for in the construction of the equation of flux, and the con¬ 
sequences of this neglect ■will be considered in the next section. 

A further difficulty lies in the use of the word ‘originate’. Taylor ( 1915 ) describes 
an eddy as breaking away from the surrounding air, thereby implying that at 
the time when it can be considered a normal sample of its surroundings it is also 
(vertically) at rest relative to them. This assumption was later discarded when the 
possibility of different quantities having different diffusion coefficients was discussed 
in greater detail. If, however, it be admitted that an eddy is typical of its environ¬ 
ment at different levels according to the property under consideration, then there 
is no reason to suppose that it is vertically at rest at any of these levels. It therefore 
does not foUow that w, the upward component of vertical velocity, and I, the vertical 
distance travelled upwards since,the eddy was last a normal specimen of its 



548 C. H. B. Priestley and W. C. Swinbank 

surroundings, are necessarily of the same sign. It is quite possible for an eddy to main¬ 
tain its identity and to traverse the datum level suecessively in opposite directions 
-without, at intermediate times, going through stages where it is typical of its 
surroundings with regard to each transferable property m turn. In fact there is 
no reason why it should not do so according to any theory which attributes no 
special significance to the element being instantaneously at rest in the vertical. 


4 . MODmOATIOlT OP THE THEORY 

A modified formulation of the theory of flux will now be given in an attempt to 
overcome the logical difficulties of the previous section, by adhering to the eddy 
ffliYiTig length method of approach while giving to the concepts ‘level of origin’ 
and ‘normal sample of surroundings ’ a more exact physical definition than has been 
given hitherto. 

Proceeding &om expression (2), consider the eddy as having last been at rest (in 
the vertical) at a level I below the datxim level 2. It follows immediately from this 
•definition, by geometrical requirement independent of physical arguments, that w 
and I are of the same sign for every eddy, whatever the nature of the turbulent 
motion. When last at rest the'eddy may be supposed to have had an excess tem¬ 
perature T" over the average value jr(,( 2 —Z) of its surroundings. The lifetime of an 
eddy, as in the classical theory, is assumed short in comparison with the time scale 
of any net average rise or fall of temperature at any level. Then, if the eddy moves 
adiabatically through the distance I, 

To(z) + r = T^{z-l) + T“-ri, 

and, assuming dTJdz to be constant over the range of the mixing length. 



* 

It should be noted that T' is greater than, equal to, or less than T" according as the 
lapse rate is respectively greater than, equal to, or less than P. 

Substituting from (4) in (2), the flux of energy across the datmn surface is seen 


to be 


which may be written 


F = PoS 

—^^;Zj 



F = PoCp 

[-zi 




(5) 


where K is now an essentially positive quantity. 

The first term in (5) represents a transfer of heat from regions of high to regions of 
low potential temperature and the Tate of transfer is proportional to the gradient 
of T+Pz. This is essentially the term derived by the earlier theories, though if K 



Vertical transport of heat by turbulence in the atmosphere 549 

is to bo analyzed by means of its components a somewhat different interpretation 
than heretofore is to be given to the definition of 1. 

The second term is the result of the additional consideration that eddies starting 
from any level may have temperatures differing from the average value of their 
surroundings. In fact it represents a discrimination in the direction of motion of 
individual eddies, as opposed to the purely indiscriminate mixing given by the first 
term. It must be emphasized that it refers to the correlation between the upward 
velocity of the eddy as it crosses the datum level z and the excess temperature of 
the eddy when it was last ai rest. Thus if there is any overall tendency, however sMght, 
for the warmer eddies to start to jrise, the sign of the second term becomes essentially 
positive and results in a systematic upward flux of heat, irrespective of the aign 
of dO/dz. 

Having separated the flux into two terms in (5) it will be convenient to refer to 
them by distinctive names, and the terms ‘mechanical turbulence’ and ‘convective 
turbulence’ are suggested as appropriate, the word turbulence being retained since 
even in the convective form the motion is heterogeneous and the behaviour of 
individual particles cannot be specified. The second term in ( 6 ) will also be referred 
to as the ‘buoyancy term’. 

Taylor (unpublished) has recognized that convective forces and dynamical forces 
result in different laws for the heat flux. His treatment, which has been further 

dT 

applied by Sutton (unpublished), assumes a flux proportional to -^+r through 

a K which toaay take alternative analytical forms, but the existence of the term called 
the buoyancy term has not been recognized. After this paper was written it was 
pointed out that Ertel ( 1942 ) has given an equation similar to (5) above, but his 
main deductions and conclusions differ fundamentally from those here presented. 

Since the buoyancy term wT" is not a measurable quantity, the importance of 
the modification to the theory can only be inferred. This may be done directly from 
special observations made in investigations of turbulence, or indirectly from 
synoptic data. The synoptic approach is essentially the more tentative, and therefore 
the less conclusive: but since it is the more important to the meteorologist, and has 
received unduly little attention in previous publications on this subject, it will be 
given first. 

[Note. In comparing the two theories quantitatively, it will frequently be neces¬ 
sary to estimate a value of K. Where this has to be done the usual practice is followed 
of considering all features of wind, stability, and height above groimd, and assigning 
the most appropriate value by reference to standard works on the subject. Specific 
reference is quoted in most oases. This procedure has the disadvantage that two 
theories are under comparison by means of a criterion which strictly applies only to 
one of them. But there appears to be no alternative method. Since the values quoted 
in the reference works are deduced by other applications of the classical theory, 
the procedure followed here at least represents a test of the consistency and ability 
of the classical theory to explain all the observed phenom.®!^**] 



550 


C. H. B. Priestley and W. C. Swinbank 


6 . CoNSIDBRAnON OP SOME SYNOPTIC DATA 

Appeal will here be made to three distinct types of synoptic data. In each case it 
will be shown that, whereas the classical theory fails to account adequately for 
certain observed facts, the additional consideration of convective turbulence will 
provide a qualitative explanation of the observed discrepancies. 

(i) The problem of explaining the average temperature distribution in the tropo¬ 
sphere has been referred to in § 3. Over a long period of time the action of convective 
turbulence will bring about an increase of potential temperature with height, and 
the rare occurrence of dry-adiabatic layers away from the surface requires no further 
explanation. This result is vital in cloud and rain forecasting, and it may be that the 
effect is of importance even if the normal fluctuations of temperature at, say, 
10,000 ft. are found on investigation to be only of the order of a fraction of a degree. 
Looking a little further, it is possible that an approximate balance between mech¬ 
anical and convective turbulence at very high levels may account for the ability of 
radiative processes to dominate the heat transfer, and so achieve the isothermal 
state characteristic of the lower stratosphere, whereas mechanical turbulence would 
cause a decrease of temperature with height. 

(ii) In the case where a super-adiabatic lapse rate exists from the ground upwards, 
then in that region where the decrease of potential temperature with height gives 
way to an increase the classical theory would predict an accumulation of heat. If 
it is possible to determine two pressure levels at which the lapse rates throughout 
an interval of time remam respectively greater and less than the dry-adiabatic, it 
is possible to assess the amount of heat flowing into the region bounded by these 
levels. 

Aircraft soundings are sufficiently detailed to allow identification of such levels, 
and for a 6 hr. time mterval it is frequently possible to determine two such levels 
not more than 16 mb. apart. The gradient of potential temperature at either boun¬ 
dary of this layer is usually about 10 “*° C/cm. If for the lower level, where the 
lapse rate is super-adiabatic, one assumes (Brunt 1939 ; Haurwitz 1941 ) & K of 
10 ® c.g.s. units, then heat will pass into the layer at a rate of about 2-6 x 10 ~® 
cal./cm.^/seo., which would cause a rate of increase of temperature of the layer of 
about 3° C/hr. Some inflow of heat through the upper boundary should yield a 
stiU greater rate of warming. 

An examination was made of a year’s aircraft soundings from Aldergrove near 
Lough Neagh m Northern Ireland, and all cases noted where a super-adiabatic 
lapse rate was sufficiently marked and deep to be immediately noticeable on a 
temperature entropy diagram. Of the 733 occasions when a super-adiabatic lapse 
rate persisted for 6 hr. or more, over 80 % occurred in the 60 mb. nearest to the 
ground with the m inim u m potential temperature at some height between 100 and 
1500 ft. The observers (pilots of Coastal Command of the Royal Air Force) are 
instructed to report temperature singularities, especially in the lower layers, and the 



Vertical transport of heat hy turbulence in the atmosphere 551 

instrumental accuracy is such that it is reasonably sure that the potential tem¬ 
perature (5) as computed is less than 1-0°P in error.* 

A selection was then made of those oases in which the point of minimum potential 
temperature did not move by more than 16 mb. in 6 hr. By inspection of the synoptic 
charts, situations in which the tempera,ture change due to horizontal adveotion was 
significant were eliminated. Of the remaining cases, only 64 % showed an increase 
of temperature in the 6 hr. at the level of minimum potential temperature, while 
36% showed a decrease. The mean increase of temperature over all oases was 
0*6° F/6 hr., which is a very small fraction of that predicted above. One can only 
deduce either that the assumed value of K (which has been estimated from other 
applications of the same theory) is grossly in error, or that the heat brought upwards 
by turbulence through the unstable layer is capable of escaping through the upper 
boundary. Convective turbulence provides such a mechanism. 

Figures from two situations are shown as examples of the working data: 


date 

1 June 1944 

_-A_ 


2 June 1944 


5 June 1944 

___A___ 

time 

1730 

2330 

0630 

1130 

1730 

2324 

0536 

1130 

1812 

surface pressure (mb.) 

1004 

1008 

1009 

1009 

1009 

1010 

991 

996 

998 

pressure (mb.) at 

965 

970 

980 

975 

970 

976 

965 

960 

950 

point (P) of mini¬ 
mum $ 










temp, at P (®P) 

48 

43 

43 

49 

60 

47 

45 

45 

45 

condition immedi- 

‘S.A, 

A. 

S.A. 

A. 

S.A. 

S.A. 

S.A. 

S.A. 

S.A. 


ately below P 

A. a= adiabatio lapse; S.A. = super-adiabatio lapse, as identi^5able on a temperature entropy 
diagram. 

(iii) When the lapse rate is less than dry-adiabatic right down to theearth’s surface, 
then the classical theory predicts a flux which is everywhere downwards. In con¬ 
sidering the changes of temperature in the lowest layers of air and top inches of soil 
the heat quantities involved are then the eddy flux, radiation from the earth’s 
surface, latent heat of vaporization of water condensed on or evaporated from the 
surface, insolation, radiation from clouds, and the change in sensible heat of the 
layers of ground. The number of factors involved, as is so often the case in meteoro¬ 
logical processes, is so great that a quantitative consideration is difficult. Neverthe¬ 
less this problem is so urgent and important in meteorology that it has been 
attempted for one type of situation in which many of the above- quantities can be 
most accurately assessed. 

The winter situation known as ‘antioyclonic gloom’ may persist for days and is 
characterized by extensive cloud of the inversion t 3 q)e at about 2000 ft., surface 

winds of 5 to 10 m.p.h., and a value of +r of about 6 x 10“®° C/cm. at 60 ft. This 

latter figure is derived from detailed lower-atmosphere soundings at Cardington, 

* It is the practice in the Meteorological Office to measure temperature m "F. 



552 C. H. B. Priestley and W. C. Swinbank 

and confirms that given by Johnson & Heywood (1938) (table III, overcast days 
in December). The presence of the low cloud enables us to arrive at a reasonably 
sormd estimate of the quantities of heat involved in radiation. If the cloud tem¬ 
perature exceeds that at the ground, which is sometimes the case in these situations, 
and both are assumed to radiate as black bodies, then water vapour between cloud 
and ground approximately halves the black-body radiation interchange, and at 
ordinary temperatures the ground will gain heat at a rate of about 0* 1 caL/om.®/hr./° 0 
temperature difference. This difference does not normally exceed 2° 0 . The estima¬ 
tion of the latent heat quantities is made difficult by the lack of experimental data, 
but information received fromRothamsted Experimental Station (for which we are 
grateful to Dr H. L. Penman) indicates that in these situations evaporation may be 
negligible, or may even be negative, i.e. water is condensed on the earth’s surface. 

The flux of heat through the 50 ft. level may be estimated from the classical theory 
by taking Z as 2 x 10* (Best (1935), table XIII by inference; Johnson & Heywood 
(1938), tables XV and XXIII). The amount of heat flowing dowhwards m 2 days 
is then 36 cal./cm.®. The heat capacity of 60 ft. of air can be' neglected by comparison 
with that of a few inches of soil, and measurements of changes of soil temperature 
show that nearly all the heat should go to warming the top 4 in. This layer should 
then warm by 6® C. Records of earth temperature in the top 8 in. of soil taken over 
the past 2 years at Dunstable and Cardington show that in such.situations the 
change in sensible heat of the top layers of the ground in 2 days is almost negligible. 
Thus, when consideration is given to the other heat quantities involved, it app'ears 
that in fact the flux of energy due to eddying motion must be a small fraction of the 
above 36 cal., and may even be directed upwards. 

The three types of synoptic data thus strongly indicate that the classical theory 
fails to give a complete picture of the turbulent exchange. In each case the second 
term in equation (6) gives a term which acts in the right sense to account for the 
discrepancy. 


6. Variations of TBMPBRATtrBB at a fixed levee 

A direct assessment of the importance of convective turbulence requires a know¬ 
ledge of the magnitude of T". This is not a measurable quantity, but its order of 
magnitude must be the same as that of T', the variation of temperature at a fixed 
level. Observations of T' have recently been taken at Cardington at a site over 
fiat open grassland with good exposure. Temperature traces are obtained by means 
of a continuous record of the e.m.f. from sensitive thermocouples (36 s.w.g. oppper, 
40 s.w.g. Emeka). The cold junctions of these instruments are immersed in melting 
ice and the aspirated warm junctions, shielded from radiation, provide a con- 
tinuoim record of the air temperature at any required height, to, which the instru¬ 
ment is hoisted by attachment to a balloon cable. 

Corrections to the recorded amplitude for instruments of this t3/pe have been 
studied by Bilham (1935)- ^0^ ^ke instrument in use an initial difference between 



VeTtical transport of heat hy turbulence in the atmosphere 553 

instrument and air temperatxrre is reduced by 90 % in 6 sec., so that for fluctuations 
of period of the order of 10 sec. about one-half of the true amplitude is recorded. 

Figure 1, (n), (6) and (c), shows temperature traces typical of certain synoptic 
situations. Time and temperature scales are indicated. 


iinwh ^ 




f Sin* 


(«) 




5-» 


ia 


10 min. 5 

UwJ-1-. 


time scale (a) 


0 

» X « 


1+2“’C 



Htart* 

- 

if ^ g 4 min. 

"time «cale (c) 

FiaxTEB 1. Variation, of temperature recorded by a sensitive thermocouple thermograph at 
Cardington, Beds, (a) 4 ft. 6 in. above groxmd, 1450 to 1660 hr. g.m.t., 27 June 1945. Air in 
rear of cold front, northerly wind 6 to 10 m.p.h., variable amounts of cumuliform cloud, nine- 
tenths at start, clearing gradually to five-tenths. (6) 3700 ft., 1500 g.m.t. on 3 August 1945; 
clear, hot summer afternoon with little wind, (c) 3000 ft., 1400 g.m.t, on 11 Septei^er 1945;* 
dull and overcast, with moderate southerly winds. Marked advective cooling of air. 

Before discussing the full implications of these traces it is of interest to see to 
what extent their features can be explained by the classical theory of mechanical 
turbulence. If this mechanism is to control the eddy flux of heat, it must at the same 
time account for the distribution of temperature observed at any level. The upper 

[dT \ 

limit to the fluctuation produced by this means is 




554 C. H. B. Priestley and W. C. Swinbank 

ultimate Tni-irmg is a gradual rather than a discontinuous process, one would expect 
the fluctuations to be appreciably less than this. 

Another upper limit, this time to the physically possible values of the mixing 
length, is set by the fact that heat is being continually diffused at the boundaries 
of the eddy. It is known that the ‘size-spectrum’ of atmospheric eddies contains 
eddies of all sizes down to very small dimensions. Since we are concerned in this 
section with hot and cold parcels of some tens of feet in diameter it is natural to 
suppose that the heat of such parcels is mainly diffused by eddies of smaller size which 
are, nevertheless, very active diffusing agents as compared with molecular processes. 
Scrase ( 1930 ) has shown that a large proportion of eddies at a height of 19 m. have 
periods of the order of 1 sec., and further that such eddies account for a value of K 
(for momentum) of 6 x 10 * cm.®/seo. 

The equation for heat conduction with spherical symmetry may be written 

dT ^(d^T 2 ^\ 
dt \ 0 r* '*'r dr f" 

The solution of this equation which satisfies the mitial conditions 
T = To for r < JJ, T = 0 for . r > iJ 

is given by ' ^ “ MiU )+^ ), 

_ .^2 + f 

If Scrase’s value of K is adopted as being applicable to this scale of heat diffusivity, 
the solution shows that for an eddy of diameter 20 m. the excess temperature at the 
centre will fall to one-third of its initial value in about | min. Half-way from the 
centre to the boundary the fall to one-third will occupy 6 sec. In practice the region 
outside the eddy boundary, or sink region, is subject to fluctuating temperatures 
which will farther accelerate the loss of heat of the eddy. 

Since w is of the order of 60 cm./sec., an eddy life of only some seconds will result 
in small values of 2 in the surface layers. It must be admitted that the magnitudes 
used in this calculation are no more than tentative, and that the small eddies may 
not be operative to the same extent at greater distances from the earth’s surface. 
Nevertheless, it seems highly improbable that an I of some hundreds of metres, 
which would be necessary to cause the observed fluctuations, can occur. 

Returning to the illustrated traces, figure 1 (a) shows the variations of temperature 
at 4 ft. 6 in. above the ground for a period of about 2 hr. (1460 to 1660 G.M.T. on 27 June 
1945). The trace was taken a few hours after the passage of a cold front, with a light 
(6 to 10 m.p.h.) northerly wind and variable amounts of oumuliform cloud clearing 
from nine-tenths shortly after the start of the period. Allowing for Bilham’s correc- 


where 

and 



V&rtical transport of heat hy turbulence in the atmos^phere 655 

tion factor we may deduce representative (root mean square) values of 2^' of 1-5° 0 
while the temperature was approximately steady, but 2 ° 0 and 0 - 6 ° 0 while the 
temperature was respectively rising and falling rapidly, with 0-5° 0 during the 
initial overcast period. 

Although detailed information as to the temperature proffle in the lowest layers 
was not available at Cardington on this occasion, reference to investigations by 
Johnson&Heyw.ood(i 938 ) and Best ( 19 ^ 5 ) over similar sites permits a good estimate 
of the significant elements to be made. It is a fact of observation from Cardington 
that lapses greater than 4° C between screen level and 350 ft. have not been observed, 
so that the peaks and troughs of the temperature trace could only be due to eddies 
coming from very close to the surface and from several hundreds of feet respectively. 
On the basis of the computations made from Scrase’s figures this explanation is 
most unlikely. (Note that the troughs are not noticeably broader than the peaks, 
as would then be the case.) Its improbability may be further demonstrated by 
watching drifting clouds of smoke near the ground, for it may be seen that only 
occasional ‘ puffs ’ rise or fall more than a few feet even in very turbulent conditions. 
Furthermore, Best ( 1935 , table XIII) arrives at a value of X of about 10 ® at 4ft. Gin. 
on clear days, and with his representative value of w (table XXXIII) of 50 cm./sec. 
appropriate to winds of 6 to 10 m.p.h. one is not justified in assessing the value of I 
at more than about 1 m. Sutton (unpublished) deduces a yet much smaller value 
of K in these conditions. A lapse rate of 50Jr is a generous estimate for this level 
(Johnson & Heywood 1938 , table III; Best 1935 , table V). One may therefore 
quote 1 ° C as a very generous upper limit to the total range of fluctuations due to 
mechanical turbulence, ignoring the accelerating effect of the fluctuations in the 
sharing of heat between eddies and their surroundings. The actual range of 4° 0 
cannot be explained on the basis of the transport of average eddies from different 
levels. 

Figure 1 ( 6 ), obtained at a height of 3700 ft. about 1500 g.m.t. on 3 August 1945, 
is typical of the fluctuations on a clear hot summer afternoon with little wind. 
Representative values of T' from this trace are 0-5° C and the lapse rate was so close 
to the dry-adiabatic that a mixing length of 1600 ft. would be required to account for 
the fluctuations. Nothing is known of the values of w at this height, but an assump¬ 
tion of 100 cm./sec. would lead to an eddy-life of 8 min. and a value of K probably 
well in excess of 10 ®, which can hardly be accepted. A smaller value of w, say 
20 cm. /sec., would imply an eddy-life of about 40 min., which must again be rejected 
on the basis of earlier calculations. 

7, The BtroYAiroY tebm 

It appears that mechanical turbulence, in failing to account for the observed 
fluctuations, must be equaUy inadequate to give a complete representation of the 
flux of heat. It remains to estimate whether the buoyancy term, i.e. the second term 
on the right-hand side of equation ( 6 ), is of significant magnitude compared with the 


Vol. 189. A. 


36 



556 0. H. JB. Priestley and W. C. Swinbank 

first term. This estimation merely inYolres a rearrangement of the arguments of the 
previous section, using the same examples and figures. 

It is not possible to measure T directly, but the appropriate values of I and 
dTJdz, when substituted in equation (4), show that T" cannot differ in order of 
magnitude from T'. Obser\'’ations of T' can therefore be used in assessing the 

magnitude of wT". 

In example 1 (a) the estimated magnitudes are accordingly: 

X 103 om.3/sec. or considerably less (Best 1935 , table XIII). 

fl'T 

^+r 0-006° C/cm. 

oz 

K +jfj 5° 0 X cm./sec. or much less. 

T" 2 ° C. 

w 60 cm./sec. 

dT 

Thus even with the very generous magnitudes allowed for K and + P it requires 

only a very small correlation between w and T* to produce a term of comparable 

magnitude, and the buoyancy flux ma/y be several times greater than + /'I • 

In example 1 ( 6 ), dT^ldz can be estimated from synoptic observations, and we have 

X 10 ® cm.3/seo. [a generous estimate for light winds, see Brunt ( 1939 ) 

and Haurwitz ( 1941 )]. 

-g+r 10-«°c/om. 

xl^+r^ l°Cx cm./sec. 

T" 0 - 6 ° C. 


w is not known, but it is clear that the buoyancy term is potentially the greater. 

So far as absolute magnitudes are concerned, considerations of the heat balance 
in the lower layers indicate that the upward flux in example 1 (a) is of the order of 

( dT \ 

-^4-/'I is only 4 eal./om.^/hr., but a 

figure of 16 would be provided by the wT" term provided that the correlation coeffi¬ 
cient between w and T" were as high as 0 - 2 . Such a correlation, which implies that 
only 4 % ofthetotal variance Zio® is associated with variations in T", is a very modest 
estimate. A similar correlation coefficient in 1 ( 6 ), assuming a representative w of 
60 cm./sec., would give a flux at 3700 ft. of 4 cal./cm.^/hr., which would represent 
a very important contribution to the heat balance. 

Thus a wide variety of sjmoptic evidence points to the conclusion that the buoyancy 
term wT" is not only important but, in many situations, may be dominant in the 



F 6Ttical transport qf heat hy turbulence in the atmosphere 657 


heat flux. Away from the earth’s surface the assumption that the magnitudes of 
T" and T' are equal becomes more than ever justifiable, but w cannot readily be 
measured and so it is desirable to find some expression for the buoyancy flux which 
does not contain w explicitly. To do this further assumptions are necessary, and 
these may best be made fi:om dimensional considerations. In the extreme case when 
the motion duo to buoyancy predominates, an element which has moved upwards 
a distance z from its rest position will have an upward acceleration 


W 


dw 


£[r.±,(f+r)', 


the sign being + or - according as the lapse rate is greater or less than the adiabatic, 
and the sign of proportionality being used to allow for the gradualness of the mixing 
process. From considerations of conduction in the previous section it may be sup¬ 
posed that the path L is short enough for the variations in Jf, and dToldt to be ignored 
when integrating, whence 


•^0 L 




The path L will be determined by the magnitude of T" and the lapse rate, and so will 
be assumed proportional to the length defined by 




dT, 

dz 


+r 


From this it follows that 


w®oc- 




T, 


3To 

0 Z 


+r 


and the rate of upward flux due to buoyancy 


wT" QC • 


WH)' 


( 6 ) 


Formula ( 6 ) and its derivation are not applicable when the lapse rate is very close 
to the dry-adiabatic. 

It is of some interest to see how the buoyancy term may be expected to behave in 
the region where the entities may be approximated by simple power laws in terms 
of the height. Reference to Johnson & Heywood ( 1938 ) figure 16, shows that on 
sunny June days the fluctuations fall off approximately as the square root of height, 

dT 

while from the first part of table III it appears that will fall off as some 

power between 1-6 and 2 . Thus in such conditions the buoyancy term will fall off 
only very slowly with height. By measuring the temperature trends at different 
levels Sutton (unpublished) also deduces that the upward flux under these conditions 
shows only a very gradual decrease with height. 


36-2 



558 


C. H. B. Priestley and W. C. Swinbank 


8. Combination or mbohanioal and bttoyanoy rnjx 


In the preceding sections a sharp distinction has been drawn between mechanical 
and convective turbulence, as represented by the two terms in equation (6), This 
was done in order to emphasize the importance of the new term, but in reality the 
distinction is rather artificial. The two effects are necessary concomitants of each 
other, andareinmanyrespectsinseparable. This maybe demonstrated by considering 
the extreme case in which buoyancy forces alone are operative, and in which each 
element is allowed to move without interference from other elements. If its motion 
is adiabatic, the element will then execute an approximately simple-hannonio 
motion with semi-range equal to the buoyancy length (defined below) and no net 
flux of heat will result. In order to account for a flux of heat, allowance must be 
made for the mutual interaction of the elements in the sharing of heat content and 
in interference with relative motions. It is known that mechanical turbulence will 
increase this mutual interaction many times over its magnitude under ordinary 
molecular processes; so that, even in cases where the flux of heat is predominantly 
controlled by convective turbulence, the mechanical turbulence plays an essential 
part in ensuring that heat carried upwards is shared by the upper layers of air and 
is not subsequently brought down again by a reversal of the buoyancy force on the 
same elements. 


The mathematical link between the two terms can be illustrated by the intro¬ 
duction of a length 

££$- 1 ./- 
dz ^ 


This length may be termed the ‘buoyancy length’ to distinguish it from the more 
familiar concept of mixing length. When the lapse rate is less than adiabatic, the 
buoyancy length has an immediate physical significance as representing the distance 
through which a buoyant element can move adiabatically before losing the net 
upward force upon it. 

Substituting from (7) in (6), the rate of upward flux becomes 




which may be written 



( 8 ) 


an expression analogous to equation (3) for the classical theory. For lapse rates less 
than adiabatic, K' is capable of assuming both positive and negative values, so that 
the direction of flux is not necessarily determined by the gradient of potential 
temperature. K' will depend, among other things, on the amplitude of the tem¬ 
perature fluctuations at a fixed level. 

Whereas, in preceding sections, examples and arguments have been adduced to 
show that the flux of heat may often be opposed to that indicated by the gradient 



Vertical transport of heat hy turbulence in the atmosphere 559 

of potential temperature, there is good evidence that in certain situations that 
gradient does in fact coincide with the direction of heat flux. Por example, when air is 
being cooled by adveotion over cold ground, records of earth temperature at Car- 
dington have shown that a usual rate of warming of the ground is a mean rise of 
temperature of 0'5“ C/hr. in the top 8 in. of soil. This represents a very substantial 
rate of transfer of heat from air to ground and, if both mechanical and convective 
turbulence are operative in such conditions, it might be expected that the latter 
would be very small since its contribution to the heat flux is always upwards. 
Accordingly temperatmre traces have been made in the lowest few thousand feet 
on several such occasions. The trace shown at figure 1 (c), which is typical, shows 
the temperature behaviour at 3000 ft. on 11 September 1946, when marked advective 
cooling of the air was in progress, accompanied by overcast skies, and the top 8 in. 
of soil were being warmed at a mean rate of over 0 - 6 ° C/hr. Similar traces were 
obtained at 2, 4, 6 and 6000 ft. 

Comparison of the trace with those shown in figure 1 (a) and ( 6 ) shows a notable 
absence of fluctuations and indicates that the upward flux of heat by convective 
turbulence is much less than in the examples studied in these figures, thus confirming 
the expectation above. Here, therefore, it seems that mechanical turbulence 
dominates the transfer of heat, and the gradient of potential temperature is in the 
direction of heat flow. 

It is of interest to estimate the value of K' necessary to transport heat downwards 
fast enough to warm the earth at the rate indicated above. The heat capacity of the 
top layers of the soil at Cardington has been measured—it is approximately 
0‘6 cal./o.o. for the degree of wetness occurring on 11 September 1945. Thus, with 
gj» 

— H-r approximately 10 “*°C/om., we find K' to be about 6 x 10 * c.g.s. units. 

This compares with Taylor’s ( 1915 ) estimate of 3 x 10 ® over the sea (Great Banks) 
in a similar wind, but with a marked inversion replacing the isothermal condition 
at Cardington, 

Thus in situations of advective cooling the flux of heat is directed downwards, 
K' being positive and of considerable magnitude. In fine weather in su mm er the 
lapse rate near the ground is greater than adiabatic and K' will be large and positive 
there, but may change sign in the region where the lapse rate becomes less than 
adiabatic with the flux of heat still directed upwards. When cold air is being warmed 
by adveotion the evidence points to a considerable flux of heat into regions, well 
away from the surface, where the lapse rate is stable, and thus indicates a large 
negative value of K'. This coefficient is then seen to depend very largely on the 
synoptic situation. The classical theory also stresses this dependence of K, but the 
considerations presented here indicate that not only the magnitude but also the 
sign of K' is a function of the synoptic situation. 



660 


C. H. B, Priestley and W. C. Swinbank 


9 . GbISTBRAL APPBBOIATIOlir AND EXTENSION TO OTHBR PROPERTIES 

In the extreme types of situation characterized by advective cooling and warming 
of air it seems that mechanical and convective turbulence respectively dominates the 
vertical heat flux. In the synoptic situations ranging between these extremes it 
seems hkely that the one type of turbulence will gradually yield in importance to 
the other, and the various situations will be characterized according to the extent 
of predominance of the one or the other mechanism. It is possible that in certain 
‘quiet’ t 3 q)es of weather a state of balance between the two may be approached. 
It is not possible to categorize, from this point of view, the different situations that 
occur without farther extensive experimental investigation. This investigation 
should cover several heights at all times of day in a wide variety of synoptic situa¬ 
tions, and should embrace different regions, with special attention to the nature of 
the u nderlying surface. At the same time an instrument might be devised to measure 
wT' directly, and thus the total flux as shown by expression (2). 

Prom the theoretical standpoint there is the need for extension of the theory and 
observations to properties other than temperatme. The arguments of § 6 applied 
to the quantity T+Fz are equally valid for any other physical property which may 
be considered as conserved in vertical motion up to the time at which an eddy loses its 
individuality by mixing processes. Examples of such properties are momentum in a 
given direction, specific humidity, dust content, and the salinity of the ocean. Now in 
a turbulent regime every property is likely to be subject at all levels to fluctuations 
similar to those represented by T". Thus for each property whose transfer by 
turbulence is considered, the possibility of the existence of the discriminatory second 
term in (6) must be borne in mmd. This applies immediately to any property which 
depends upon the density, or which affects the density, of the medium. In such oases 
the potential magnitude of the second term must, if possible, be estimated in relation 
to that of the first. From the fact that each property which evidences a discrimina¬ 
tion is likely to do so to a different extent, it follows that the measure of transport 
may be expected to be different for various properties; and that a pronounced 
discrimination in the case of one property will affect the transport of a second pro¬ 
perty if the two are for any reason correlated. So far as is known this paper provides 
the first real physical justification for the use of different values oiK for the transport 
by turbulence of different properties. 

It also follows from expression (6) and the definition of I that properties which are 
subject to no discrimination in the direction of transport, and are uncorrelated with 
any property which is so subject, will be transported by the same mechanism, as 
represented by the first term, with the same value of . This result depends on the 
validity of regarding the mixing process as discontinuous; which is. justifiable 
pro’rided that the mechanical disruption of the eddy is the predominant factor in 
mixing, as opposed to the gradual transfer of the property from one eddy to another 
by molecular processes. 



Vertical transport of heat toy turbulence in the atmosphere 561 

In connexion with other properties it is of interest to note that a continuous record 
of the wet-bulb temperature was obtained synchronously with the dry-bulb trace 
of figure 1 (a). The similarity of the peaks and troughs in the two traces was extremely 
striking, but consideration of the implied fiuctuations in humidity has been left 
for future work. 

We wish to express our gratitude to the Director of the Meteorological Office for 
permission to publish this paper and for the use of observational material, and to 
members of the Meteorological Research Committee for criticisms of the first draft 
of the paper. Our thanks are also due to Mr H. Bateman and his staff at the Balloon 
Development Establishment at Cardington for assistance in the instrumental work, 
and for co-operation in the collection of data which have largely provided the material 
basis for this paper. 


Reebeenoes 

Ashbum, 0. M. 1940 Bull. Amer. Met, Soc, 21 , 205. 

Best, A. 0. 1935 Oeophya. Mem, no. 65. 

BUham, E. G. 1935 Quart, J. Boy, Met. Soc. 61, 259. 

Brunt, D. 1929 Proc. Boy, Soc. A, 124, 201. 

Brunt, D. 1939 Physical and dynamical meteorology, 2 nd ed. Camb. Univ. Press. 

Elsasser, W, M. 1942 Earv. Met. Stud, no. 6 . 

Ertel, H. 1942 Met, Z, 59, Heft 8 , p. 260. 

Haurwitz, B. 1941 Dynamic meteorology. New York: MoGraw Hill. 

Johnson, N. K. <fe Heywood, G. S. P. 1938 Geophys, Mem. no. 77. 

Petterssen, S. <& Swinbank, W. 0. Large-scale turbulence in the free atmosphere (un¬ 
published). 

Biohardson, L. F. 1920 Proc, Boy, Soc. A, 97, 354. 

Scrase, F. J. 1930 Geophys, Mem, no. 62. 

Sutton, 0. G. Convection in the atmosphere near the ground (unpublished). 

Taylor, Q, L 1913 Phil. Trans. A, 216, 1 . 

Taylor, G. I. Convection from a line source in a side wind (unpublished). 




INDEX TO VOLUME 189 (A) 


Acock, a. l»., (Jarnor, W. E., Miktod, J. & Willavoys, H. J. The dehydration of 
ammonium, potaHHium and some mixed alums, 608. 

AU'xandroff, P. On the dimension of normal spaces, 11. 

Atmosphere. Vertical transport of heat by turbulence (Priestley & Swinbank), 643. 

Atmospheric tides in the ionosphere. I. Solar tides in the jPj region (Martyn), 241. 

Biharmonic problems. Mixed boundary conditions in the relaxational treatment (plane 
strain or stress) (Pox), 536. 

Blair, G. W. 8., Veinoglou, B. 0. & Caffyn, J. E. Limitations of the Newtonian time 
scale in r<tlation to non-equilibrium rheological states and a theory of quasi-pro¬ 
perties, 59. 

Bloatmy, B. & Penrose, E. P. The inversion spectrum of ammonia at centimetre wave¬ 
lengths, 368. 

Bristow, J. E. Kinetic boundary friction, 88. 

Buckingham, R, A. & Corner, J. Tables of second virial and low-pressure Joule- 
Thomson coefficients for interraolecular potentials with exponential repulsion, 118. 

Bunn, C. W. & Garner, E. V. The crystal structures of two polyamides (‘nylons’), 39. 

Burnett, G. M. dc Melville, H. W. Determination of the velocity coefficients for poly- 
meriaation processes. I. The direct photopolymerization of vinyl acetate. II. The 
photosensitized polymerization of vinyl acetate by benzoyl peroxide. III. Effect of 
environment on polymerization processes, 466, 481, 494. 

Caffyn, tl. K. rise Blair, Veinoglou & Cafljm. 

Corner, .1. »SV,fi Buckingham & Corner. 

Crawford, B. H. Visual adaptation in relation to brief conditioning stimuli [Abstract], 
284. 

(irystal structure of barium titanium. Temperature changes (Megaw), 261. 

Crystal structures of two polyamides (‘nylons’) (Bunn & Gamer), 39. 

Dehytlration of ammonium, potassium and some mixed alums (Acock, Garner, Milsted 
■& Willavoys), 608. 

Determination of the velocity coefficients for polymerization processes. I, II, and III 
(Burnett & Melville), 466, 481, 494. 

Dimension of normal spaces (Aloxandroff), 11. 

Effect of crossed electric magnetic fields on the helium spectrum (Poster & Pounder), 287. 

Environment. Effect on polymerization processes (Burnett & Melville), 494. 

Explosives, rate of detonation (Jones), 416. 

Fine structure of the lines of the Hj speotmm which go down to the 2p^I7 states (Poster 
& Richardson), 17^. • 

Fine structure of the spectral lines of Hj: measurements with a reflexion echelon (Foster 
& Richardson), 149. 

Foster, E. W. & Richardson, Sir Owen. The fine structure of the lines of the Hg spectrum 
which go down to the 2p^IT states, 176. 

[ 663 ] 



564 hidex 

Foster, E. W. & Richardson, Sir Owen. The fine structure of the spectral lines of 
measurements with a reflexion echelon, 149. 

Foster, J. S. & Pounder, E. R. Effect of crossed electric magnetic fields on tlu^ helium 
spectrum, 287. 

Pox, L. Mixed boundary conditions in the relaxational treatment of biharmonio 
problems (plane strain or stress), 535. 

Fragmentation of shell cases (Mott), 300. 

Garner, E. V. See Bunn & Garner. 

Garner, W. E. See Acock, Garner, Milsted & Willavoys. 

Gases after the electric discharge has passed. The surprising amount of energy which 
can be coUected from (Rayleigh), 296. 

Gaydon, A. 6. & Whittingham, G. The spectra of flames containing oxides of sulphur, 313. 
General kinetic theory of liquids. II. Equilibrium properties (Green), 103. 

Gold. Slow neutron-induced activity (Siegbahn), 527. 

Green, H. S. A general kinetic theory of liquids. II. Equilibrium properties, 103. 
Grew, K. E. Thermal diffusion in mixtures of the inert gases, 402. 

Harish-Chandra, Infinite irreducible representations of the Lorentz group, 372. 
Hartley, Sir Harold, Antoine Laurent Lavoisier, 427. 

Helium spectrum, effect of crossed electric magnetic fields (Foster & Pounder), 287. 

Inert gases. Thermal diffusion in mixtures of the (Grew), 402. 

Infinite irreducible representations of the Lorentz group (Harish-Chandra), 372. 
Ionosphere, atmospheric tides (Martyn), 241. 

Ionosphere, radiative equilibrium (Woolley), 218. 

Interaction of steam and charcoal at low pressures (Strickland-Constable), L 
Inversion spectrum of ammonia at centimetre wave-lengths (Bleaney & Penrose), 368. 

Jones, H. A theory of the dependence of the rate of detonation of solid explosives on 
the diameter of the charge, 415. 

Kinetic boundary friction (Bristow), 88. 

Lavoisier, Antoine, Laurent (Hartley), 427. 

Limitations of the Newtonian time scale in relation to non-equilibrium rheological states 
and a theory of quasi-properties (Blair, Veinoglou & Caffyn), 69. 

Liquids. A general kinetic theory. II. Equilibrium properties (Green), 103. 

Lorentz group. Infinite irreducible representations of the (Harish-Chandra), 372. 

Martyn, D. F. Atmospheric tides in the ionosphere. I. Solar tides in the region, 241. 
Megaw, H. D. Temperature changes in the crystal structure of barium titanium oxide, 
261. 

Melville, H. W. See Burnett & Melville. 

Merton, Sir Thomas. On a method of increasing contrast in microscopy, 309. 

Method of increasing contrast in microscopy (Merton), 309. 

Microscopy. On a method of increasing contrast in (Merton), 309. 

Milsted, J. See Acock, Garner, Milsted & Willavoys. 

Mixed boundary conditions in the relaxational treatment of biharmonio problems (plane 
strain or stress) (Fox), 535. 

Mott, N. F. Fragmentation of shell cases, 300. 



Imdex 565 

Newtonian time scale. Limitations in relation to non-equilibrium rheological states and 
a theory of quasi-properties (Blair, Veinoglou & CafEyn), 69. 

Oblique reflexion of very long wireless waves from the ionosphere (Wilkes), 130. 

Penrose, R. P. 8ee Bleaney & Penrose. 

Photopolymerization of vinyl acetate (Burnett & Melville), 456. 

Photosonsitizod polymerization of vinyl acetate by benzoyl peroxide (Burnett & 
Melville), 481, 

Polyamides. Crystal structure (Bunn & Gamer), 39. 

Polymerization processes. Determination of the velocity coefficients. I, 11 and in 
(Btirnett & MelviUe), 466, 481,494. 

Pounder, E. R. See Poster & Pounder. 

Priestley, 0. H, B, & Swinbank, W. 0. Vertical transport of heat by turbulence in the 
atmosphere, 643. 

\ 

Radiative equilibrium in the ionosphere (Woolley), 218. j, 

Rayleigh, Lord. The surprising amount of energy which can be collected from gases 
after the electric discharge has passed, 296. 

Relaxational treatment of biharmonic problems (plane strain or stress). Mixed boundary 
conditions (Pox), 635. 

Richardson, Sir Owen. See Poster & Richardson. 


Shell oases. Pragmentation of (Mott), 300. 

Siegbahn, K. Slow neutron-induced activity in gold, 627. 

Simple group of order 25920 (Todd), 326. 

Slow neutron-induced activity in gold (Siegbahn), 627. 

Solar tides in the Pj region. Atmospheric tides in the ionosphere. I. (Martyn), 241. 
Spectra of flames containing oxides of sulphur (Gaydon & Whittingham), 313. 

Steam and charcoal. Interaction at low pressures (Striokland-Oonstable), 1. 
Striokland-Oonstable, R. P. Interaction of steam and charcoal at low pressures, 1. 
Surprising amoimt of energy which can be collected from gases after the electric discharge 
has passed (Rayleigh), 296. 

Swinbank, W. 0. See Priestley & Swinbank. 

Tables of second virial and low pressure Joule-Thomson coefficients for intermoleoular 
potentials with exponential repulsion (Buckingham & Comer), 118. 

Temperature changes in the crystal stmoture of barium titanium oxide (Megaw), 261. 
■Theory of the dependence of the rate of detonation of solid explosives on the diameter 
of the charge (Jones), 416. 

Thermal diffusion in mixtures of the inert gases (Grew), 402. 

Todd, J. A. On the simple group of order 25920,326. ; • t. i ^ 

Turbulence in the atmosphere. Vertical transport of heat by (Pnestley & Swmbank), 436. 


Determination I, II and III (Burnett 


Veinoglou, B. C. See Blair, Veinoglou & Caffyn. 

Velocity coefficients for polymerization processes. 

& Melville), 466, 481,494. «. a • v i \ fnn 

Vertical transport of heat by turbulence in the atmosphere (Pnestley & Swmbank), 643. 
Vinyl acetate. The direct photopolymerization (Burnett & Melville), 466. 



666 Index 

Vinyl acetate. The photosensitized polymerization by benzoyl peroxide (Burnett 
& Melville), 481. 

Visual adaptation in relation to brief conditioning stimuli [Abstract] (Crawford), 284. 
'Whittiogham, G. See Gaydon & Whittingham. 

Wilkes, M. V. The oblique reflexion of very long wireless waves from the ionosphere, 130. 
Willavoys, H. J. See Acock, Gamer, Moisted & WiUavoys. 

Wireless Waves. Oblique reflexion of waves from the ionosphere (Wilkes), 130. 
Woolley, E. v. d. E. Eadiative equilibrium in the ionosphere, 218. 


END OE THE ONE HUNDEBD AND EIGHTT-NINTH VOLUME (SEBIBS A) 





I.A.R.I. n 


INDIAN AGRICULTURAL RESEARCH 
INSTITUTE LIBRARY, NEW DELHI. 


Date of I«aue 

Date of Isgue 

Date of Istue 

7 /'^ 


* 

< * 

# 












♦ 



.. •'•y . ‘It . - 

...." 


f 

> ' ' 



■" ■" " ■ ' V 

V 












'■“'V . 


- 

‘ 



♦ 


OIPNLK—H-40 LA.R.I.—29-4-55—15,000 





